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(57) Abstract 



cDNAs encoding l-deoxyxylulose-5-phosphate synthase from peppermint Mentha piperita have been isolated and sequenced, and 
the corresponding amino acid sequences have been determined. Accordingly, isolated DNA sequences (SEQ ID NO:3, SEQ ID NO:5, 
SEQ ID NO:7) are provided which code for the expression of l-deoxyxylulose-5-phosphate synthase from essential oil plant species. 
In other aspect the present invention provides for isolated, recombinant plant DXPS proteins, such as the proteins having the sequences 
set forth in SEQ ID NO:4, SEQ ID NO:6 and SEQ ID NO: 8. In other aspects, replicable recombinant cloning vehicles are provided 
which code for plant l-deoxyxylulose-S-phosphate synthases, or for a base sequence sufficiently complementary to at least a portion 
of l-deoxyxylulose-5-phosphate synthase DNA or RNA to enable hybridization therewith. In yet other aspects, modified host cells 
are provided that have been transformed, transfected, infected and/or injected with a recombinant cloning vehicle and/or DNA sequence 
encoding a plant l-deoxyxylulose-5-phosphatc synthase. Thus, systems and methods are provided for the recombinant expression of the 
aforementioned recombinant l-deoxyxyiulose~5-phosphate synthase that may be used to facilitate its production, isolation and purification 
in significant amounts. Recombinant l-deoxyxylulose-S-phosphate synthase may be used to obtain expression or enhanced expression of 
l-deoxyxylulose-5-phosphate synthase in plants in order to enhance the production of l-deoxyxylulose-5-phosphate, or its derivatives 
such as isopentenyl diphosphate (IPP), or may be otherwise employed for the regulation or expression of l-deoxyxylulose-5-phosphate 
synthase, or the production of its products. 
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NUCLEIC AND AMINO ACID SEQUENCES FOR A NOVEL TRANS KETOLASE FROM $i (MENTHA PIPERITA) 

This invention was funded, in part, by U.S. Department of Energy Grant 
No. DE-FG03-96ER20212. The United States government may have certain rights in 
5 the invention. 

Field of the Invention 
The present invention relates to nucleic acid sequences which code for a novel 
transketolase from peppermint (Mentha x piperita), and to vectors containing the 
sequences, host cells containing the sequences and methods of upregulating or 
10 downregulating the production or activity of the transketolases and their mutants. 

BackRround of the Invention 
The isoprenoids comprise the largest family of natural products with over 
20,000 individual compounds described to date (Connolly, J.D. & Hill, R.A., 
Dictionary of Terpenoids (Chapman and Hall, London, 1991)). The isoprenoids play 
15 numerous functional roles in plants as hormones (gibberellins, abscisic acid), 
photosynthetic pigments (side chain of phytol, carotenoids), electron carriers (side 
chain of plastoquinone), and structural components of membranes (phyto sterols). 
Isoprenoids also serve in communication and defense, for example as attractants for 
pollinators and seed dispensers, and as competitive phytotoxins, antibiotics, and 
20 herbivore repellents and toxins (Harbome, J.B. in Ecological Chemistry and 
Biochemistry of Plant Terpenoids (Harbome, J.B., Tomas-Berberan, F.A., Eds.), 
pp. 399-426 (Clarendon Press, Oxford, 1991)). 

Until recently, it was generally assumed that all isoprenoids were synthesized 
from acetyl-CoA via the classical mevalonate pathway (Spurgeon, S.L. & Porter, 
25 J.W., Eds., in Biosynthesis of Isoprenoid Compounds, Vol. 1, pp 1-46 (John Wiley, 



BNSOOCID: <WO__991 1 757A1 _L> 



wo 99/11757 PCT/US98/ 18231 

-2- 

New York, 1983)). However, in 1993, Rohmer and co-workers (Rohmer, M. et al., 
Biochem. 1 295:517-524 (1993)) demonstrated that a non-mevalonate pathway, 
originating from pyruvate and glyceraldehyde-3 -phosphate (GAP) (Rohmer, M. et al., 
J. Am. Chem. Soc. 118:2564-2566 (1996)), operated in several eubacteria, including 
5 E, coli. Evidence subsequently emerged that the plastid-derived isoprenoids of plants, 
including carotenoids and the prenyl side chains of chlorophyll and plastoquinone 
(Lichtenthaler, H.K. et al., FEES Lett. 400:271-274 (1997)), as well as isoprene 
(Zeidler LG. et al., Z. Naturforsch. 52c: 15-23 (1997)), monoterpenes (Eisenreich, W. 
et al.. Tetrahedron Lett. 38:3889-3892 (1997)) and diterpenes (Eisenreich, W. et al., 
10 Proc. Natl Acad Set USA 93:6431-6436 (1996)); (Schwarz, M.K., PhD thesis, 
ETH, Zurich, Switzerland (1994)), are synthesized via the pyruvate/GAP route to 
isopentenyl diphosphate (IPP). This new pathway had been completely overlooked in 
the past. 

The first dedicated reaction of this new enzymatic pathway to IPP is 

1 5 considered to involve a transketolase-type condensation involving pyruvate and GAP 
to form l-deoxy-/D-xylulose-5-phosphate (Rohmer, M. et al., /. Am. Chem. Soc. 
118:2564-2566 (1996)); (Zeidler J.G et al., Z. Naturforsch. 52c:15-23 (1997)); 
(Broers, S.T.J., PhD thesis, ETH, Zurich, Switzeriand ( 1 994)) (FIGURE I). A recent 
abstract has described the cloning of a gene encoding l-deoxyxylulose-5-phosphate 

20 synthase from E. coli, but no sequence information, or other descriptive information, 
was reported (Lois, L.M. et al.. Third Terpnet Meeting of the European Network on 
Plant Isoprenoids, Poitiers, France, May 29-30 (1997)). 

Summary of the Invention 
In accordance with the present invention, isolated nucleic acid sequences, such 

25 as the sequences set forth in SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:3, SEQ ED 
NO:5 and SEQ ID NO:7 which encode all or part of a l-deoxyxylulose-5-phosphate 
synthase (abbreviated as DXPS) from peppermint {Mentha ^ piperita) have been 
isolated, identified and characterized. Thus, the present invention provides nucleic 
acid sequences encoding essential oil plant 1-deoxyxylulose- 5 -phosphate synthase 

30 proteins. In particular, the present invention provides nucleic acid sequences 
encoding l-deoxyxyiulose-5-phosphate synthase proteins from the family Lamiaceae. 
In another aspect the present invention provides for isolated, recombinant plant DXPS 
proteins, such as the proteins having the sequences set forth in SEQ ID NO:4, SEQ 
ID NO:6 and SEQ ID NO:8. In other aspects, the present invention is directed to 

35 replicable recombinant cloning vehicles comprising a nucleic acid sequence, e.g., a 
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DNA sequence, which codes for DXPS or for a base sequence sufficiently 
compleinentary to at least a portion of the DXPS DNA or RNA to enable 
hybridization therewith (e.g., antisense transketolase RNA or fragments of 
complementary transketolase DNA which are useful as polymerase chain reaction 
5 primers or as probes for transketolases from Mentha x piperita or related genes). In 
yet other aspects of the invention, modified host cells are provided that have been 
transformed, transfected, infected and/or injected with a recombinant cloning vehicle 
and/or DNA sequence of the invention. Thus, the present invention provides for the 
recombinant expression of the transketolase l-deoxyxylulose-5 -phosphate synthase 
1 0 (DXPS) from peppermint (Mentha x piperita) and related transketolases from plants, 
and the inventive concepts may be used to facilitate the production, isolation and 
purification of significant quantities of recombinant plant transketolases (or of the 
primary enzyme products) for subsequent use, such as to obtain expression or 
enhanced expression of transketolases in plants to attain enhanced production of 

15 predator or pathogen defense compounds, or may be otherwise employed in an 
environment where the regulation or expression of transketolases are desired. In 
other aspects, the regulation of isoprenoid biosynthesis in plants by transforming, 
transfecting, infecting and/or injecting the plant with a recombinant cloning vehicle 
and/or DNA sequence of the invention to obtain expression of the transketolase 

20 DXPS or a related transketolase in the plant and thereby upregulate the 
pynjvate/gIyceraldehyde-3 -phosphate isoprenoid biosynthetic pathway. Thus, in 
addition to the new nucleic acid sequences and fragments thereof, the present 
invention includes new vectors containing the sequences, host cells containing the 
sequences, isolated recombinant transketolase (synthase) polypeptides and methods of 

25 producing recombinant transketolases and their mutants. 

Brief Description of the Drawings 
The foregoing aspects and many of the attendant advantages of this invention 
will be better understood by reference to the following detailed description, when 
taken in conjunction with the accompanying drawings, wherein: 

30 FIGURE 1 is a schematic representation of the enzymatic pathway fi-om 

pyruvate to isopentenyl diphosphate (IPP) as catalyzed by l-deoxyxylulose-5- 
phosphate synthase (DXPS). The addition of thiamin pyrophosphate (TPP) -activated 
acetaldehyde, formed by decarboxylation of pyruvate, to CI of glyceraldehyde-3- 
phosphate (GAP) and subsequent loss of TPP yields I -deoxyxylulose-5 -phosphate, 

35 which ultimately gives rise to isopentenyl diphosphate (IPP) 
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FIGURE 2A shows a GC-MS analysis of the product formed by the 
recombinant DXPS peppermint enzyme. Mass fragmentation patterns are illustrated 
for the biosynthetic product after dephosphorylation and trimethylsilylation 
(Rt = 6,71 min). 

5 FIGURE 2B shows a GC-MS analysis of the silylated derivative of authentic 

1 -deoxy-£)-xylulose (R^ = 6.70 min). 

FIGURE 3 is a graphical representation of the time-course of relative steady- 
state DXPS mRNA levels (-•-) and rate of monoterpene biosynthesis as measured by 
l'*C02 incorporation (-0-) during leaf development in peppermint. Total RNA was 

10 isolated from oil gland secretory cells of leaves of different developmental stages. A 
^2p-Iabeled probe derived from DXPS clone pDS29 (SEQ ID NO:5) detected a 
transcript of about 3 kb. Leaves are fliiiy expanded by two weeks, and high rates of 
monoterpene biosynthesis and high steady-state levels of DXPS mRNA are observed 
only during early leaf development (< 7 d). 

15 FIGURE 4 shows the results of clustering relationship analysis based on 

sequence comparisons carried out using GCG version 9.0 of the University of 
Wisconsin Genetics Computer Group Package (1997). The following transketolase 
sequences are included: DXPS {Mentha x piperita, accession number AFO 193 83), 
CLAl {Arabidopsis thaliana, U27099), ORF28I4 {Rhodobacter capsulata, VieiM\ 

20 ORFf620 {Escherichia coli, U82664), a protein of unknown function {Synechocystis 
sp. PC6803, D90903), transketolase 1 (human, A45050; yeast P23254; Escherichia 
coli, P27302), transketolase 2 (human, P51854; yeast, P33315; Escherichia coli, 
P33570), and a plastidial transketolase from potato (Z50099). 

Detailed Description of the Preferred Embodiment 

25 As used herein, the terms "amino acid" and "amino acids" refer to all naturally 

occurring L-a-amino acids or their residues. The amino acids are identified by either 
the single-letter or three-letter designations: 



30 



35 



Asp 


D 


aspartic acid 


He 


I 


isoleucine 


Thr 


T 


threonine 


Leu 


L 


leucine 


Ser 


S 


serine 


Tyr 


Y 


tyrosine 


Glu 


E 


glutamic acid 


Phe 


F 


phenylalanine 


Pro 


P 


proline 


His 


H 


histidine 


Giy 


G 


glycine 


Lys 


K 


lysine 


Ala 


A 


alanine 


Arg 


R 


arginine 


Cys 


C 


cysteine 


Trp 


W 


tryptophan 
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Val V valine Gin Q glutamine 

Met M methionine Asn N asparagine 

As used herein, the term "nucleotide" means a monomeric unit of DNA or 
RNA containing a sugar moiety (pentose), a phosphate and a nitrogenous heterocyclic 
5 base. The base is linked to the sugar moiety via the glycosidic carbon (T carbon of 
pentose) and that combination of base and sugar is called a nucleoside. The base 
characterizes the nucleotide with the four bases of DNA being adenine ("A"), guanine 
("G"), cytosine ("C") and thymine ("T"). Inosine ("1") is a synthetic base that can be 
used to substitute for any of the four, naturally-occurring bases (A, C, G or T). The 
10 four RNA bases are A,G,C and uracil ("U"). The nucleotide sequences described 
herein comprise a linear array of nucleotides connected by phosphodiester bonds 
between the 3' and 5' carbons of adjacent pentoses. 

"Oligonucleotide" refers to short length single or double stranded sequences of 
deoxyribonucleotides linked via phosphodiester bonds. The oligonucleotides are 
15 chemically synthesized by known methods and purified, for example, on 
polyacrylamide gels. 

The term "l-deoxyxylulose-5-phosphate synthase" (abbreviated as "DXPS") is 
used herein to mean an enzyme capable of catalyzing a transketolase-type 
condensation involving pyruvate and glyceraldehyde-3 -phosphate (GAP) to form 
20 l-deoxy-Z)-xylulose-5-phosphate. This reaction is schematically set forth in FIGURE 
I. 

The term "essential oil plant," or "essential oil plants," refers to a group of 
plant species that produce high levels of monoterpenoid and/or sesquiterpenoid and/or 
diterpenoid oils, and/or high levels of monoterpenoid and/or sesquiterpenoid and/or 

25 diterpenoid resins. The foregoing oils and/or resins account for greater than about 
0.005% of the fresh weight of an essential oil plant that produces them. The essential 
oils and/or resins are more fully described, for example, in E. Guenther, The Essential 
Oils, Vols. I-V7, R.E. Krieger Publishing Co,, Huntington N.Y., 1975^ incorporated 
herein by reference. The essential oil plants include, but are not limited to: 

30 Lamiaceae, including, but not limited to, the following species; Ocimum 

(basil), Lavandula (Lavender), Origanum (oregano), Mentha (mint). Salvia (sage), 
Rosmecinus (rosemary), Thymus (thyme), Satureja and Monarda. 

Umbelliferae, including, but not limited to, the following species: Carum 
(caraway), Anethum (dill), feniculum (fennel) and Daucus (carrot). 
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Asteraceae (Compositae), including, but not limited to, the following species: 
Artemisia (tarragon, sage brush), Tanacetum (tansy). 

Rutaceae {e.g., citrus plants); Rosaceae {e.g., roses); Myrtaceae (e.g., 
eucalyptus, Melaleuca); the Gramineae {e.g., Cymbopogon (citronella)); Geranaceae 
5 (Geranium) and certain conifers including Abies (e.g., Canadian balsam), Cednis 
(cedar) and Thuja and Juniperus. 

The range of essential oil plants is more fully set forth in E. Guertther, The 
Essential Oils, Vols. I-VI, RE. Krieger Publishing Co., Huntington N.Y., 1975, 
which is incorporated herein by reference. 

10 Abbreviations used are; bp, base pair; DMAPP, dimethylallyl diphosphate; 

DXPS, l-deoxyxylulose-5-phosphate synthase; GAP, glyceraldehyde-3-phosphate; 
IPP, isopentenyl diphosphate; ; Mopso, 3-(A^-morpholino)-2-hydroxypropane-sulfonic 
acid; Tris, Tris-(hydroxymethyl)anunomethane; UTR, untranslated region; TLC, thin 
layer chromatography; Tr, truncation site; GC, gas chromatography; Hepes, 

1 5 hydroxyethylpiperazine-A'^-2-ethanesulfonic acid; kb, kilobase pairs. 

The abbreviation "SSC" refers to a buffer used in nucleic acid hybridization 
solutions. One liter of the 20X (twenty times concentrate) stock SSC buffer solution 
(pH 7.0) contains 175,3 g sodium chloride and 88.2 g sodium citrate. 

The terms "alteration", "amino acid sequence alteration", "variant" and "amino 

20 acid sequence variant" refer to l-deoxyxylulose-5-phosphate synthase molecules with 
some differences in their amino acid sequences as compared to the corresponding, 
native, i.e., naturally-occurring, I -deoxyxylulose- 5 -phosphate synthases. Ordinarily, 
the variants will possess at least about 70% homology with the corresponding native 
l-deoxyxylulose-5-phosphate synthases, and preferably they will be at least 

25 about 80% homologous with the corresponding, native l-deoxyxylulose-5 -phosphate 
synthases. The amino acid sequence variants of the l-deoxyxylulose-5-phosphate 
synthases falling within this invention possess substitutions, deletions, and/or 
insertions at certain positions. Sequence variants of l-deoxyxylulose-5-phosphate 
synthases may be used to attain desired enhanced or reduced enzymatic activity, 

30 modified regiochemistry or stereochemistry, or altered substrate utilization or product 
distribution. 

Substitutional l-deoxyxylulose-5 -phosphate synthase variants are those that 
have at least one amino acid residue in the native l-deoxyxylulose-5 -phosphate 
synthase sequence removed and a different amino acid inserted in its place at the same 
35 position. The substitutions may be single, where only one amino acid in the molecule 
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has been substituted, or they may be multiple, where two or more amino acids have 
been substituted in the same molecule. Substantial changes in the activity of the 1- 
deoxyxylulose-5-phosphate synthase molecules of the present invention may be 
obtained by substituting an amino acid with a side chain that is significantly different 
5 in charge and/or structure from that of the native amino acid. This type of 
substitution would be expected to affect the structure of the polypeptide backbone 
and/or the charge or hydrophobicity of the molecule in the area of the substitution. 

Moderate changes in the activity of the l-deoxyxylulose-5 -phosphate synthase 
molecules of the present invention would be expected by substituting an amino acid 

10 with a side chain that is similar in charge and/or structure to that of the native 
molecule. This type of substitution, referred to as a conservative substitution, would 
not be expected to substantially alter either the structure of the polypeptide backbone 
or the charge or hydrophobicity of the molecule in the area of the substitution. 

Insertional l-deoxyxylulose-5 -phosphate synthase variants are those with one 

15 or more amino acids inserted immediately adjacent to an amino acid at a particular 
position in the native l-deoxyxyIulose-5 -phosphate synthase molecule. Immediately 
adjacent to an amino acid means connected to either the a-carboxy or a-amino 
functional group of the amino acid. The insertion may be one or more amino acids. 
Ordinarily, the insertion will consist of one or two conservative amino acids. Amino 

20 acids similar in charge and/or structure to the amino acids adjacent to the site of 
insertion are defined as conservative. Alternatively, this invention includes insertion 
of an amino acid with a charge and/or structure that is substantially different from the 
amino acids adjacent to the site of insertion. 

Deletional variants are those where one or more amino acids in the native 1- 

25 deoxyxylulose-5-phosphate synthase molecules have been removed. Ordinarily, 
deletional variants will have one or two amino acids deleted in a particular region of 
the l-deoxyxylulose-5-phosphate synthase molecule. 

The terms "biological activity", "biologically active", "activity" and "active" 
refer to the ability of the l-deoxyxylulose-5 -phosphate synthases of the present 

30 invention to catalyze a transketolase-type condensation involving pyruvate and 
glyceraldehyde-3 -phosphate (GAP) to form l-deoxy-D-xylulose-5 -phosphate. This 
reaction is schematically set forth in FIGURE 1. l-deoxyxylulose-5-phosphate 
synthase activity is measured in an enzyme activity assay, such as the assay described 
in Example 2. Amino acid sequence variants of the l-deoxyxylulose-5 -phosphate 

35 synthases of the present invention may have desirable altered biological activity 
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including, for example, altered reaction kinetics, substrate utilization, product 
distribution or other characteristics such as regiochemistry and stereochemistry. 

The terms "DNA sequence encoding", "DNA encoding" and "nucleic acid 
encoding" refer to the order or sequence of deoxyribonucleotides along a strand of 
5 deoxyribonucleic acid. The order of these deoxyribonucleotides determines the order 
of amino acids along the translated polypeptide chain. The DNA sequence thus codes 
for the amino acid sequence. 

The terms "replicable expression vector" and "expression vector" refer to a 
piece of DNA, usually double-stranded, which may have inserted into it a piece of 

10 DNA (the insert DNA) from another source. The vector is used to transport the 
insert DNA into a suitable host cell. The insert DNA may be derived from the host 
cell, or may be derived from a different cell or organism. Once in the host cell, the 
vector can replicate independently of or coincidental with the host chromosomal 
DNA, and several copies of the vector and its inserted DNA may be generated. In 

15 addition, the vector contains the necessary elements that permit translating the insert 
DNA into a polypeptide. Many molecules of the polypeptide encoded by the insert 
DNA can thus be rapidly synthesized. 

The terms "transformed host cell," "transformed" and "transformation" refer to 
the introduction of DNA into a cell. The cell is termed a "host cell", and it may be a 

20 prokaryotic or a eukaryotic cell. Typical prokaryotic host cells include various strains 
of E. coll. Typical eukaryotic host cells are plant cells, such as maize cells, yeast 
cells, insect cells or animal cells. The introduced DNA is usually in the form of a 
vector containing an inserted piece of DNA. The introduced DNA sequence may be 
from the same species as the host cell or from a different species from the host cell, or 

25 it may be a hybrid DNA sequence, containing some foreign DNA and some DNA 
derived from the host species. 

In accordance with the present invention, total RNA was extracted from 
secretory cells which had been isolated from 5 -day-old peppermint leaves. Poly(A)'^- 
RNA was purified by chromatography on oligo(dT)-cellulose (Pharmacia), and 5 |ig 

30 of the resulting mRNA was utilized to construct a XZAP cDNA library. One hundred 
and fifty randomly picked and purified clones were in v/vo-excised and the resulting 
phagemids were sequenced using T3 and T7 primers. Two clones (designated pDS 1 
and pDS2) were identified which exhibited significant sequence similarity to a recently 
described Arabidopsis thaliana transketolase gene of unknown function (CLAJ). The 
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nucleotide sequence of the cDNA insert of pDSl is set forth in SEQ ID NO: 1, and the 
nucleotide sequence of the cDNA insert of pDS2 is set forth in SEQ ID NO:2. 

A set of 3,000 plaques was then screened with a nucleic acid probe (SEQ ID 
NO:9) derived by PGR from the cDNA insert (SEQ ID NO:l) of pDSl. This 
5 procedure afforded 47 positive signals under high-stringency hybridization conditions. 
After one additional cycle of hybridization, the positive clones were in v/vo-excised 
the insert sizes were determined by PGR, and the 20 largest clones were partially 
sequenced. Three of these clones (designated pDS16, pDS29 and pDS39) appeared 
to be of full-length and were entirely sequenced on both strands. The nucleotide 
10 sequence of the cDNA insert of pDS16 is set forth in SEQ ID NO:3, the nucleotide 
sequence of the cDNA insert of pDS29 is set forth in SEQ ID NO: 5 and the 
nucleotide sequence of the cDNA insert of pDS39 is set forth in SEQ ID NO:7. 

The cDNA insert of DXPS clone pDS29 (SEQ ID NO:5), which yielded the 
highest expressed level of synthase activity, contains an open reading frame (ORE) of 
15 2172 nucleotides. The first 70 deduced amino acid residues (amino acid residues 1-70 
of SEQ ID NO:6) show the general characteristics of plastidial targeting sequences, 
consistent with the proposed subcellular location of the enzyme in plant ceils. By 
excluding the putative transit peptide residues, the sequence corresponds to a mature 
protein of about 650 amino acids, with a predicted size of roughly 71 kDa. An 
20 alignment of translated transketolase sequences (devoid of plastid-targeting peptides 
where appropriate) shows very high similarity/identity values between the peppermint 
DXPS and several other transketolases 

E, coll cultures transfected with phagemids derived from pDS16, including the 
cDNA sequence set forth in SEQ ID N0;3, pDS29, including the cDNA sequence set 
25 forth in SEQ ID NO: 5, and pDS39, including the cDNA sequence set forth in SEQ ID 
NO:7, were each induced with isopropyl-1 thio-y?D-galactopyranoside (IPTG), the 
corresponding bacterial cells were harvested and homogenized, and the extracts were 
assayed using [2-i^G] pyruvate and A/'-glyceraIdehyde-3 phosphate as co-substrates. 
Preparations from E. coli ceils transfected with pDS29, including the cDNA sequence 
30 set forth in SEQ ID N0:5, and pDS39, including the cDNA sequence set forth in 
SEQ ID NO:7, yielded a prominent new radioactive component in the reaction 
mixture that, upon reversed-phase ion-pair radio-HPLC, exhibited a R^ (35.5 min) 
consistent with that of a sugar (pentulose) phosphate. The same enzymatic product 
was generated with r)-glyceraldehyde-3 -phosphate as co-substrate, indicating that the 
35 D-antipode is the likely natural substrate of the ftinctionally expressed transketolase. 
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The presumptive pentulose phosphate product from preparative enzyme incubations 
(E. coli transformed with pDS29, including the cDNA sequence set forth in SEQ ID 
NO: 5) was purified by HPLC and hydrolyzed with acid phosphatase, and the resulting 
sugar was silylated. This derivatized product of the recombinant enzyme was then 
5 analyzed by combined capillary GC-MS and shown to possess the identical retention 
time (6.71 ± 0.03 min) and mass spectrum as that of an authentic sample of silylated 
l-deoxy-Z)-xylulose (FIGURES 2 A and 2B). The combined evidence thus indicated 
that a cDNA encoding l-deoxyxylulose-5-phosphate synthase {DXPS) had been 
acquired. DXPS activity was significantly higher in the EPTG-induced E. coli cells 

10 expressing the cDNA insert of pDS29 (SEQ ID NO: 5), when compared to identically 
treated cells containing the same plasmid devoid of the cDNA insert set forth in SEQ 
ID NO:5 (7-fold higher than endogenous activity, n = 7, p < 0.01). 

RNA blot analyses showed a direct correlation between steady-state levels of 
the DXPS message and monoterpene production, as determined by CO2 

15 incorporation, thus suggesting activation of the non-mevalonate pathway to supply 
the IPP precursor for subsequent monoterpene biosynthesis in peppermint oil glands 
(FIGURE 3), 

The isolation of full-length cDNAs (SEQ ID NO:3, SEQ ID NO:5, SEQ ID 
NO: 7) encoding l-deoxyxy(uIose- 5 -phosphate synthase permits the development of 

20 efficient expression systems for this functional enzyme; provides usefiil tools for 
examining the developmental regulation of DXPS; permits investigation of the 
reaction mechani sm(s) of this enzyme, and permits the isolation of other 1- 
deoxyxylulose-5 -phosphate synthases. The isolation of full-length l-deoxyxylulose-5- 
phosphate synthase cDNAs (SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7) also 

25 permits the transformation of a wide range of organisms in order to enhance, or 
otherwise alter, the synthesis of l-deoxyxylulose-5-phosphate, and of its derivatives, 
such as rPP. 

Although the full-length l-deoxyxylulose-5-phosphate synthase proteins set 
forth in (SEQ ID N0:4, SEQ ID N0:6, SEQ ID NO:8) directs the enzyme to plastids, 

30 substitution of the presumptive targeting sequence of this enzyme (e.g., SEQ ID 
NO:3, amino acid residue numbers 1 to 70) with other transport sequences well 
known in the art (see, e.g., vonHeijne et al, Eur. J. Biochem., 180:535-545, 1989; 
Stryer, Biochemistry, W.H. Freeman and Company, New York, NY, p. 769 [1988]) 
may be employed to direct 1-deoxyxylulose- 5 -phosphate synthase to other cellular or 

35 extracellular locations. 
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In addition to the native l-deoxyxylulose-5-phosphate synthase amino acid 
sequences of SEQ ID N0:4, SEQ ED NO:6, SEQ ID NO:8, sequence variants 
produced by deletions, substitutions, mutations and/or insertions are intended to be 
within the scope of the invention except insofar as limited by the prior art. The 1- 
5 deoxyxyIulose-5 -phosphate synthase amino acid sequence variants of this invention 
may be constructed by mutating the DNA sequences that encode the wild-type 
synthases, such as by using techniques commonly referred to as site-directed 
mutagenesis. Nucleic acid molecules encoding the l-deoxyxyIulose-5-phosphate 
synthases of the present invention can be mutated by a variety of PCR techniques well 

10 known to one of ordinary skill in the art. (See, for example, the following 
publications, the cited portions of which are incorporated by reference herein: "PCR 
Strategies", M.A. Innis, D.H. Gelfand and J.J. Sninsky, eds., 1995, Academic Press, 
San Diego, CA (Chapter 14); "PCR Protocols: A Guide to Methods and 
Applications", M.A. Innis, D.H. Gelfand, J.J. Sninsky and T.J. White, eds.. Academic 

15 Press, NY (1990). 

By way of non-limiting example, the two primer system utilized in the 
Transformer Site-Directed Mutagenesis kit from Clontech, may be employed for 
introducing site-directed mutants into the l-deoxyxylulose-5 -phosphate synthase 
genes of the present invention. Following denaturation of the target plasmid in this 

20 system, two primers are simultaneously annealed to the plasmid; one of these primers 
contains the desired site-directed mutation, the other contains a mutation at another 
point in the plasmid resulting in elimination of a restriction site. Second strand 
synthesis is then carried out, tightly linking these two mutations, and the resulting 
plasmids are transformed into a mutS strain of coli. Plasmid DNA is isolated from 

25 the transformed bacteria, restricted with the relevant restriction enzyme (thereby 
linearizing the unmutated plasmids), and then retransformed into coli. This system 
allows for generation of mutations directly in an expression plasmid, without the 
necessity of subcloning or generation of single-stranded phagemids. The tight linkage 
of the two mutations and the subsequent linearization of unmutated plasmids results in 

30 high mutation efficiency and allows minimal screening. Following synthesis of the 
initial restriction site primer, this method requires the use of only one new primer type 
per mutation site. Rather than prepare each positional mutant separately, a set of 
"designed degenerate" oligonucleotide primers can be synthesized in order to 
introduce all of the desired mutations at a given site simultaneously. Transformants 

35 can be screened by sequencing the plasmid DNA through the mutagenized region to 
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identify and sort mutant clones. Each mutant DNA can then be fiilly sequenced or 
restricted and analyzed by electrophoresis on Mutation Detection Enhancement gel 
(J.T. Baker) to confirm that no other alterations in the sequence have occurred (by 
band shift comparison to the unmutagenized control). 
5 Again, by way of non-limiting example, the two primer system utilized in the 

QuikChange™ Site-Directed Mutagenesis kit from Stratagene (LaJolla, California), 
may be employed for introducing site-directed mutants into the l-deoxyxylulose-5- 
phosphate synthase genes of the present invention. Double-stranded plasmid DNA, 
containing the insert bearing the target mutation site, is denatured and mixed with two 

10 oligonucleotides complementary to each of the strands of the plasmid DNA at the 
target mutation site. The annealed oligonucleotide primers are extended using Pfu 
DNA polymerase, thereby generating a mutated plasmid containing staggered nicks. 
After temperature cycling, the unmutated, parental DNA template is digested with 
restriction enzyme Dpnl which cleaves methylated or hemimethylated DNA, but 

15 which does not cleave unmethylated DNA. The parental, template DNA is almost 
always methylated or hemimethylated since most strains of E.coU, from which the 
template DNA is obtained, contain the required methylase activity. The remaining, 
annealed vector DNA incorporating the desired mutation(s) is transformed into E. 
coli. 

20 The sequence verified mutant duplexes in the pET (or other) overexpression 

vector can be employed to transform E, coli such as strain E. coli BL21(DE3)pLysS, 
for high level production of the mutant protein, and purification by standard 
protocols. The method of FAB-MS mapping can be employed to rapidly check the 
fidelity of mutant expression. This technique provides for sequencing segments 

25 throughout the whole protein and provides the necessary confidence in the sequence 
assignment. In a mapping experiment of this type, protein is digested with a protease 
(the choice will depend on the specific region to be modified since this segment is of 
prime interest and the remaining map should be identical to the map of unmutagenized 
protein). The set of cleavage fragments is fractionated by microbore HPLC (reversed 

30 phase or ion exchange, again depending on the specific region to be modified) to 
provide several peptides in each fraction, and the molecular weights of the peptides 
are determined by FAB-MS. The masses are then compared to the molecular weights 
of peptides expected from the digestion of the predicted sequence, and the correctness 
of the sequence quickly ascertained. Since this mutagenesis approach to protein 

35 modification is directed, sequencing of the altered peptide should not be necessary if 
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the MS agrees with prediction. If necessary to verify a changed residue, CAD-tandem 
MS/MS can be employed to sequence the peptides of the mixture in question, or the 
target peptide purified for subtractive Edman degradation or carboxypeptidase Y 
digestion depending on the location of the modification. 

In the design of a particular site directed mutagenesis, it is generally desirable 
to first make a non-conservative substitution (e.g., Ala for Cys, His or Glu) and 
determine if activity is greatly impaired as a consequence. The properties of the 
mutagenized protein are then examined with particular attention to the kinetic 
parameters of and k^^^ as sensitive indicators of altered function, from which 
changes in binding and/or catalysis per se may be deduced by comparison to the 
native enzyme. If the residue is by this means demonstrated to be important by 
activity impairment, or knockout, then conservative substitutions can be made, such 
as Asp for Glu to alter side chain length, Ser for Cys, or Arg for His. 

Other site directed mutagenesis techniques may also be employed with the 
nucleotide sequences of the invention. For example, restriction endonuclease 
digestion of DNA followed by ligation may be used to generate deletion variants of 1- 
deoxyxylulose- 5 -phosphate synthase, as described in section 15.3 of Sambrook et al. 
Molecular Cloning: A Laboratory Manual, 2nd Ed., Cold Spring Harbor Laboratory 
Press, New York, NY [1989], incorporated herein by reference. A similar strategy 
may be used to construct insertion variants, as described in section 15.3 of 
Sambrook et al., supra. 

Oligonucleotide-directed mutagenesis may also be employed for preparing 
substitution variants of this invention. It may also be used to conveniently prepare the 
deletion and insertion variants of this invention. This technique is well known in the 
art as described by Adelman et al. {DNA1:\%3 [1983]), Sambrook et a!., supra\ 
"Current Protocols in Molecular Biology*', 1991, Wiley (NY), F.T. Ausubel, R. Brent, 
R.E. Kingston, D.D. Moore, J.D. Seidman, J.A. Smith and K. Struhl, eds, 
incorporated herein by reference. 

Generally, oligonucleotides of at least 25 nucleotides in length are used to 
insert, delete or substitute two or more nucleotides in the l-deoxyxylulose-5- 
phosphate synthase molecule. An optimal oligonucleotide will have 12 to 15 perfectly 
matched nucleotides on either side of the nucleotides coding for the mutation. To 
mutagenize wild-type l-deoxyxyIulose-5-phosphate synthase, the oligonucleotide is 
annealed to the single-stranded DNA template molecule under suitable hybridization 
conditions. A DNA polymerizing enzyme, usually the Klenow fragment of E. coli 
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DNA polymerase I, is then added. This en2yme uses the oligonucleotide as a primer 
to complete the synthesis of the mutation-bearing strand of DNA. Thus, a 
heterodupiex molecule is formed such that one strand of DNA encodes the wild-type 
synthase inserted in the vector, and the second strand of DNA encodes the mutated 
5 form of the synthase inserted into the same vector. This heterodupiex molecule is 
then transformed into a suitable host cell. 

Mutants with more than one amino acid substituted may be generated in one 
of several ways. If the amino acids are located close together in the polypeptide 
chain, they may be mutated simultaneously using one oligonucleotide that codes for 

10 all of the desired amino acid substitutions. If, however, the amino acids are located 
some distance from each other (separated by more than ten amino acids, for example) 
it is more difficult to generate a single oligonucleotide that encodes all of the desired 
changes. Instead, one of two alternative methods may be employed. In the first 
method, a separate oligonucleotide is generated for each amino acid to be substituted. 

15 The oligonucleotides are then annealed to the single-stranded template DNA 
simultaneously, and the second strand of DNA that is synthesized from the template 
will encode all of the desired amino acid substitutions. An alternative method 
involves two or more rounds of mutagenesis to produce the desired mutant. The first 
round is as described for the single mutants: wild-type l-deoxyxylulose-5 -phosphate 

20 synthase DNA is used for the template, ^n oligonucleotide encoding the first desired 
amino acid substitution(s) is annealed to this template, and the heterodupiex DNA 
molecule is then generated. The second round of mutagenesis utilizes the mutated 
DNA produced in the first round of mutagenesis as the template. Thus, this template 
already contains one or more mutations. The oligonucleotide encoding the additional 

25 desired amino acid substitution(s) is then annealed to this template, and the resulting 
strand of DNA now encodes mutations from both the first and second rounds of 
mutagenesis. This resultant DNA can be used as a template in a third round of 
mutagenesis, and so on. 

A gene encoding l-deoxyxylulose-5-phosphate synthase may be incorporated 

30 into any organism (intact plant, animal, microbe, etc.), or cell culture derived 
therefrom. A l-deoxyxylulose-5 -phosphate synthase gene may be introduced into any 
organism for a variety of purposes including, but not limited to: production of 1- 
deoxyxylulose-5-phosphate synthase, or its product l-deoxyxylulose-5 -phosphate; 
enhancement of the rate of production and/or the absolute amount of one or more 

35 chemical compounds, such as IPP, derived from l-deoxyxylulose-5 -phosphate; 
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augmenting the disease resistance of plants by enhancing the production of 
terpenoid(s) having defensive, e.g., antimicrobial or antifeedant, properties. 
Additionally, a gene encoding all or part of a l-deoxyxylulose-5-phosphate synthase 
can be introduced, in antisense orientation, into any plant species in order to reduce 
5 the amount of l-deoxyxylulose-5-phosphate synthase produced in the plant, thereby 
reducing the production of terpenoids. 

Additionally, because the novel pathway involving the DXPS enzymes of the 
present invention is present in certain bacteria and plants, but not in animals, it 
provides a new molecular target for the design of highly specific antibiotics and 

10 herbicides. Thus, analysis of the structure and catalytic mechanism of DXPS proteins 
of the present invention will lead to the development of selective inhibitors of this 
enzyme having antibiotic and/or herbicidal activity. Once selective inhibitors of the 
DXPS proteins of the present invention have been identified, mutagenesis of the 
nucleic acid molecules encoding DXPS proteins of the present invention will yield 

15 inhibitor-resistant DXPS proteins. Nucleic acid sequences encoding these inhibitor- 
resistant DXPS proteins can be introduced into plants, thereby producing transgenic 
plants that are resistant to the DXPS inhibitors. Thus, for example, transgenic grass 
species used in lawns can be transformed with a nucleic acid sequence encoding 
inhibitor-resistant DXPS protein; thereafter treatment of the lawn with a DXPS 

20 inhibitor will kill unwanted weeds, but leave the transformed, DXPS-resistant grass 
unharmed. 

Eukaryotic expression systems may be utilized for the production of 1- 
deoxyxylulose-5 -phosphate synthase since they are capable of carrying out any 
required posttranslational modifications and of directing the enzyme to the proper 

25 cellular compartment. A representative eukaryotic expression system for this purpose 
uses the recombinant baculo virus, Autographa calif or nica nuclear poly hedro sis virus 
(AcNPV; M.D. Summers and G.E. Smith, A Manual of Methods for Baculovirus 
Vectors and Insect Cell Culture Procedures [1986]; Luckow et al., Bio-technology, 
6:47-55 [1987]) for expression of the l-deoxyxylulose-5-phosphate synthases of the 

30 invention. Infection of insect cells (such as cells of the species Spodoptera 
frugiperda) with the recombinant baculoviruses allows for the production of large 
amounts of the l-deoxyxyIuIose-5 -phosphate synthase proteins. In addition, the 
baculovirus system has other important advantages for the production of recombinant 
l-deoxyxylulose-5-phosphate synthase. For example, baculoviruses do not infect 

35 humans and can therefore be safely handled in large quantities. In the baculovirus 
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system, a DNA construct is prepared including a DNA segment encoding 1- 
deoxyxylulose-5-phosphate synthase and a vector. The vector may comprise the 
polyhedron gene promoter region of a baculovirus, the baculovirus flanking sequences 
necessary for proper cross-over during recombination (the flanking sequences 
5 comprise about 200-300 base pairs adjacent to the promoter sequence) and a bacterial 
origin of replication which permits the construct to replicate in bacteria. The vector is 
constructed so that (i) the DNA segment is placed adjacent (or operably linked or 
"downstream" or "under the control of) to the polyhedron gene promoter and (ii) the 
promoter/ l-deoxyxyluiose-5-phosphate synthase combination is flanked on both sides 

10 by 200-300 base pairs of baculovirus DNA (the flanking sequences). 

To produce the l-deoxyxylulose-5-phosphate synthase DNA construct, a 
cDNA clone encoding the full length 1-deoxyxylulose- 5 -phosphate synthase is 
obtained using methods such as those described herein. The DNA construct is 
contacted in a host cell with baculovirus DNA of an appropriate baculovirus (that is, 

15 of the same species of baculovirus as the promoter encoded in the construct) under 
conditions such that recombination is effected. The resulting recombinant 
baculoviruses encode the full l-deoxyxylulose-5-phosphate synthase. For example, an 
insect host cell can be cotransfected or transfected separately with the DNA construct 
and a functional baculovirus. Resulting recombinant baculoviruses can then be 

20 isolated and used to infect cells to effect production of the l-deoxyxylulose-5- 
phosphate synthase. Host insect ceils include, for example, Spodoptera frugiperda 
cells, that are capable of producing a baculovirus-expressed l-deoxyxylulose-5- 
phosphate synthase. Insect host cells infected with a recombinant baculovirus of the 
present invention are then cultured under conditions allowing expression of the 

25 baculo virus-encoded l-deoxyxylulose-5-phosphate synthase. l-deoxyxylulose-5- 
phosphate synthase thus produced is then extracted from the cells using methods 
known in the art. 

Other eukaryotic microbes such as yeasts may also be used to practice this 
invention. The baker's yeast Saccharomyces cerevisiae, is a commonly used yeast, 

30 ahhough several other strains are available. The plasmid YRp7 (Stinchcomb et al., 
Nature, 282:39 [1979]; Kingsman et al, Gene 7:141 [1979]; Tschemper et al.. Gene, 
10:157 [1980]) is commonly used as an expression vector in Saccharomyces. This 
plasmid contains the trpl gene that provides a selection marker for a mutant strain of 
yeast lacking the ability to grow in tryptophan, such as strains ATCC No. 44,076 and 

35 PEP4-1 (Jones, Genetics^ 85:12 [1977]). The presence of the trpl lesion as a 
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characteristic of the yeast host cell genome then provides an effective environment for 
detecting transformation by growth in the absence of tryptophan. Yeast host ceJls are 
generally transformed using the polyethylene glycol method, as described by Hinnen 
(Proc, Natl. Acad. ScL USA, 75:1929 [1978]). Additional yeast transformation 
5 protocols are set forth in Gietz et al., N.A.R, 20(17): 1425(1992); Reeves et al., 
FEMS, 99(2-3): 193-197, (1992), both of which references are incorporated herein by 
reference. 

Suitable promoting sequences in yeast vectors include the promoters for 
3-phosphoglycerate kinase (Hitzeman et al., J.Biol. Chem., 255:2073 [1980]) or 

10 other glycolytic enzymes (Hess et al., J. Adv. Enzyme Reg. 1:149 [1968], 
Holland et al., Biochemistry, 17:4900 [1978]), such as enolase, glyceraldehyde-3- 
phosphate dehydrogenase, hexokinase, pyruvate decarboxylase, phosphofructokinase, 
glucose-6-phosphate isomerase, 3-phosphoglycerate mutase, pyruvate kinase, 
triosephosphate isomerase, phosphoglucose isomerase, and glucokinase. In the 

15 construction of suitable expression plasmids, the termination sequences associated 
with these genes are also ligated into the expression vector 3' of the sequence desired 
to be expressed to provide polyadenylation of the mRNA and termination. Other 
promoters that have the additional advantage of transcription controlled by growth 
conditions are the promoter region for alcohol dehydrogenase 2, isocytochrome C, 

20 acid phosphatase, degradative enzymes associated with nitrogen metabolism, and the 
aforementioned giyceraldehyde-3 -phosphate dehydrogenase, and enzymes responsible 
for maltose and galactose utilization. Any plasmid vector containing yeast-compatible 
promoter, origin of replication and termination sequences is suitable. 

Cell cultures derived from multicellular organisms, such as plants, may be used 

25 as hosts to practice this invention. Transgenic plants can be obtained, for example, by 
transferring plasmids that encode l-deoxyxylulose-5-phosphate synthase and a 
selectable marker gene, e.g,, the kan gene encoding resistance to kanamycin, into 
Agrohacterium iumifaciem containing a helper Ti plasmid as described in 
Hoeckemaet al. Nature, 303:179-181 [1983] and culturing the Agrohacterium cells 

30 with leaf slices, or other tissues or cells, of the plant to be transformed as described by 
Anetal., Plant Physiology, 81:301-305 [1986]. Transformation of cultured plant 
host cells is normally accomplished through Agrohacterium tumifaciens. Cultures of 
mammalian host cells and other host cells that do not have rigid cell membrane 
barriers are usually transformed using the calcium phosphate method as originally 

35 described by Graham and VanderEb {Virology, 52:546 [1978]) and modified as 
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described in sections 16.32-16.37 of Sambrook et al., supra. However, other 
methods for introducing DNA into cells such as Polybrene (Kawai and Nishizawa, 
Mol. Cell Biol, 4:1172 [1984]), protoplast fusion (Schafflier, Proc. Natl Acad Set 
USA, 77:2163 [1980]), eiectroporation (Neumann et aL, EMBOX, 1:841 [1982]), 
5 and direct microinjection into nuclei {Capecchi, Cell, 22:479 [1980]) may also be 
used. Additionally, animal transformation strategies are reviewed in Monastersky 
G.M. and Robl, J.M., Strategies in Transgenic Animal Science, ASM Press, 
Washington, DC, 1995. Transformed plant calli may be selected through the 
selectable marker by growing the cells on a medium containing, e.g., kanamycin, and 
10 appropriate amounts of phytohormone such as naphthalene acetic acid and 
benzyladenine for callus and shoot induction. The plant cells may then be regenerated 
and the resulting plants transferred to soil using techniques well known to those 
skilled in the art. 

In addition, a gene regulating l-deoxyxylulose-5-phosphate synthase 

15 production can be incorporated into the plant along with a necessary promoter which 
is inducible. In the practice of this embodiment of the invention, a promoter that only 
responds to a specific external or internal stimulus is fused to the target cDNA. Thus, 
the gene will not be transcribed except in response to the specific stimulus. As long 
as the gene is not being transcribed, its gene product is not produced. 

20 An illustrative example of a responsive promoter system that can be used in 

the practice of this invention is the glutathione-S-transferase (GST) system in maize. 
GSTs are a family of enzymes that can detoxify a number of hydrophobic electrophilic 
compounds that often are used as pre-emergent herbicides (Weigand et al.. Plant 
Molecular Biology, 7:235-243 [1986]). Studies have shown that the GSTs are 

25 directly involved in causing this enhanced herbicide tolerance. This action is primarily 
mediated through a specific M kb mRNA transcription product. In short, maize has 
a naturally occurring quiescent gene already present that can respond to external 
stimuli and that can be induced to produce a gene product. This gene has previously 
been identified and cloned. Thus, in one embodiment of this invention, the promoter 

30 is removed fi-om the GST responsive gene and attached to a l-deoxyxylulose-5- 
phosphate synthase gene that previously has had its native promoter removed. This 
engineered gene is the combination of a promoter that responds to an external 
chemical stimulus and a gene responsible for successflil production of 1- 
deoxyxylulose-5 -phosphate synthase. 
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In addition to the methods described above, several methods are known in the 
art for transferring cloned DNA into a wide variety of plant species, including 
gymnosperms, angiosperms, monocots and dicots (see, e.g., Glick and 
Thompson, eds.. Methods in Plant Molecular Biology, CRC Press, Boca Raton, 
5 Florida [1993], incorporated by reference herein). Representative examples include 
electroporation-facilitated DNA uptake by protoplasts in which an electrical pulse 
transiently permeabilizes cell membranes, permitting the uptake of a variety of 
biological molecules, including recombinant DNA (Rhodes et al.. Science^ 
240(4849): 204-207 [1988]); treatment of protoplasts with polyethylene glycol 

10 (Lyznik et al.. Plant Molecular Biology, 13:151-161 [1989]); and bombardment of 
cells with DNA-!aden microprojectiles which are propelled by explosive force or 
compressed gas to penetrate the cell wall (Klein et al., Plant Physiol. 91:440-444 
[1989] and Boynton et al., Science, 240(4858): 1534-1538 [1988]). Transformation 
of Taxus species can be achieved, for example, by employing the methods set forth in 

15 Han et al, Plant Science, 95:187-196 (1994), incorporated by reference herein. A 
method that has been applied to Rye plants (Secale cereale) is to directly inject 
plasmid DNA, including a selectable marker gene, into developing floral tillers (de la 
Pena et al., Nature 325:274-276 (1987)). Further, plant viruses can be used as 
vectors to transfer genes to plant cells. Examples of plant viruses that can be used as 

20 vectors to transform plants include the Cauliflower Mosaic Virus (Brisson et al., 
Nature 310: 511-514 (1984); Additionally, plant transformation strategies and 
techniques are reviewed in Birch, R.G., Ann Rev Plant Phys Plant Mol Biol, 48:297 
(1997); Forester et al., Exp, Agric, 33:15-33 (1997). The aforementioned 
publications disclosing plant transformation techniques are incorporated herein by 

25 reference, and minor variations make these technologies applicable to a broad range 
of plant species. 

Each of these techniques has advantages and disadvantages. In each of the 
techniques, DNA from a plasmid is genetically engineered such that it contains not 
only the gene of interest, but also selectable and screenable marker genes. A 

30 selectable marker gene is used to select only those cells that have integrated copies of 
the plasmid (the construction is such that the gene of interest and the selectable and 
screenable genes are transferred as a unit). The screenable gene provides another 
check for the successful culturing of only those cells carrying the genes of interest. A 
commonly used selectable marker gene is neomycin phosphotransferase II (NPT II). 

35 This gene conveys resistance to kanamycin, a compound that can be added directly to 
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the growth media on which the cells grow. Plant cells are normally susceptible to 
kanamycin and, as a result, die. The presence of the NPT II gene overcomes the 
effects of the kanamycin and each cell with this gene remains viable. Another 
selectable marker gene which can be employed in the practice of this invention is the 
5 gene which confers resistance to the herbicide glufosinate (Basta). A screenable gene 
commonly used is the P-glucuronidase gene (GUS). The presence of this gene is 
characterized using a histochemical reaction in which a sample of putatively 
transformed cells is treated with a GUS assay solution. After an appropriate 
incubation, the cells containing the GUS gene turn blue. 

10 The plasmid containing one or more of these genes is introduced into either 

plant protoplasts or callus cells by any of the previously mentioned techniques. If the 
marker gene is a selectable gene, only those cells that have incorporated the DNA 
package survive under selection with the appropriate phytotoxic agent. Once the 
appropriate cells are identified and propagated, plants are regenerated. Progeny from 

15 the transformed plants must be tested to insure that the DNA package has been 
successfully integrated into the plant genome. 

Mammalian host cells may also be used in the practice of the invention. 
Examples of suitable mammalian cell lines include monkey kidney CVI line 
transformed by SV40 (COS-7, ATCC CRL 1651); human embryonic kidney line 293S 

20 (Graham et al, J, Gen, Virol., 36:59 [1977]); baby hamster kidney cells (BHK, 
ATCC CCL 10); Chinese hamster ovary cells (Urlab and Chasin, Proa Nail. Acad, 
Sci USA ll'Alie [1980]); mouse Sertoli cells (TM4, Mather, BioL Reprod., 23:243 
[1980]); monkey kidney cells (CVI-76, ATCC CCL 70); African green monkey 
kidney cells (VERO-76, ATCC CRL- 1587); human cervical carcinoma cells (HELA, 

25 ATCC CCL 2); canine kidney cells (MDCK, ATCC CCL 34); buffalo rat liver cells 
(BRL 3A, ATCC CRL 1442), human lung cells (W138, ATCC CCL 75); human liver 
cells (Hep G2, HB 8065); mouse mammary tumor cells (MMT 060562, 
ATCC CCL 51); rat hepatoma cells (HTC, MI. 54, Baumann et al, J. Cell BioL, 85:1 
[1980]); and TRI cells (Mather et al., Annals N.Y. Acad, Sci,, 383:44 [1982]). 

30 Expression vectors for these cells ordinarily include (if necessary) DNA sequences for 
an origin of replication, a promoter located in front of the gene to be expressed, a 
ribosome binding site, an RNA splice site, a polyadenylation site, and a transcription 
terminator site. 

Promoters used in mammalian expression vectors are often of viral origin. 
35 These viral promoters are commonly derived from polyoma virus, Adenovirus 2, and 
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most frequently Simian Virus 40 (SV40). The SV40 virus contains two promoters 
that are termed the early and late promoters. These promoters are particularly useful 
because they are both easily obtained from the virus as one DNA fragment that also 
contains the viral origin of replication (Fiers et al.. Nature, 273: 1 13 [1978]). Smaller 
or larger SV40 DNA fragments may also be used, provided they contain the 
approximately 250-bp sequence extending from the Hindlll site toward the Bgll site 
located in the viral origin of replication. 

Alternatively, promoters that are naturally associated with the foreign gene 
(homologous promoters) may be used provided that they are compatible with the host 
cell line selected for transformation. 

An origin of replication may be obtained from an exogenous source, such as 
SV40 or other virus (e.g., Polyoma, Adeno, VSV, BPV) and inserted into the cloning 
vector. Alternatively, the origin of replication may be provided by the host cell 
chromosomal replication mechanism. If the vector containing the foreign gene is 
integrated into the host cell chromosome, the latter is often sufficient. 

The use of a secondary DNA coding sequence can enhance production levels 
of l-deoxyxylulose-5-phosphate synthase in transformed cell lines. The secondary 
coding sequence typically comprises the enzyme dihydrofolate reductase (DHFR). 
The wild-type form of DHFR is normally inhibited by the chemical methotrexate 
(MTX). The level of DHFR expression in a cell will vary depending on the amount of 
MTX added to the cultured host cells. An additional feature of DHFR that makes it 
particularly useful as a secondary sequence is that it can be used as a selection marker 
to identify transformed cells. Two forms of DHFR are available for use as secondary 
sequences, wild-type DHFR and MTX-resistant DHFR. The type of DHFR used in a 
particular host cell depends on whether the host cell is DHFR deficient (such that it 
either produces very low levels of DHFR endogenously, or it does not produce 
functional DHFR at all). DHFR-deficient cell lines such as the CHO cell line 
described by Urlaub and Chasin, supra, are transformed with wild-type DHFR coding 
sequences. After transformation, these DHFR-deficient cell lines express functional 
DHFR and are capable of growing in a culture medium lacking the nutrients 
hypoxanthine, glycine and thymidine. Nontransformed cells will not survive in this 
medium. 

The MTX-resistant form of DHFR can be used as a means of selecting for 
transformed host cells in those host cells that endogenously produce normal amounts 
of fiinctional DHFR that is MTX sensitive. The CHO-Kl cell line (ATCC No. CL 61) 
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possesses these characteristics, and is thus a useful cell Hne for this purpose. The 
addition of MTX to the cell culture medium will permit only those cells transformed 
with the DNA encoding the MTX-resistant DHFR to grow. The nontransformed cells 
will be unable to survive in this medium. 
5 Prokaryotes may also be used as host cells for the initial cloning steps of this 

invention. They are particularly useful for rapid production of large amounts of DNA, 
for production of single-stranded DNA templates used for site-directed mutagenesis, 
for screening many mutants simultaneously, and for DNA sequencing of the mutants 
generated. Suitable prokaryotic host cells include E. coli K12 strain 94 (ATCC 

10 No. 31,446), KcoU strain W3110 (ATCC No. 27,325) K coli X1776 (ATCC 
No. 31,537), and E. coli B; however many other strains of E. coli, such as HBlOl, 
JMlOi, NM522, NM538, NM539, and many other species and genera of prokaryotes 
including bacilli such as Bacillus subtilis, other enterobacteriaceae such as Salmonella 
typhimurium or Serratia marcesans^ and various Pseudomonas species may all be 

15 used as hosts. Prokaryotic host cells or other host cells with rigid cell walls are 
preferably transformed using the calcium chloride method as described in section 1.82 
of Sambrook et al., supra. Alternatively, electroporation may be used for 
transformation of these cells. Prokaryote transformation techniques are set forth in 
Dower, W.J., in Genetic Engineering, Principles and Methods, 12:275-296, Plenum 

20 Publishing Corp., 1990; Hanahan et al., Meth. EnzymoL, 204:63 (1991). 

As a representative example, cDNA sequences encoding l-deoxyxylulose-5- 
phosphate synthase may be transferred to the (His)6«Tag pET vector commercially 
available (from Novagen) for overexpression in E. coli as heterologous host. This 
pET expression plasmid has several advantages in high level heterologous expression 

25 systems. The desired cDNA insert is ligated in frame to plasmid vector sequences 
encoding six histidines followed by a highly specific protease recognition site 
(thrombin) that are joined to the amino terminus codon of the target protein. The 
histidine "block" of the expressed fusion protein promotes very tight binding to 
immobilized metal ions and permits rapid purification of the recombinant protein by 

30 immobilized metal ion affinity chromatography. The histidine leader sequence is then 
cleaved at the specific proteolysis site by treatment of the purified protein with 
thrombin, and the I -deoxyxylulose- 5 -phosphate synthase again purified by 
immobilized metal ion affinity chromatography, this time using a shallower imidazole 
gradient to elute the recombinant synthases while leaving the histidine block still 

35 adsorbed. This overexpression-purification system has high capacity, excellent 
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resolving power and is fast, and the chance of a contaminating E. coli protein 
exhibiting siniilar binding behavior (before and after thrombin proteolysis) is 
extremely small. 

As will be apparent to those skilled in the art, any plasmid vectors containing 
replicon and control sequences that are derived from species compatible with the host 
cell may also be used in the practice of the invention. The vector usually has a 
replication site, marker genes that provide phenotypic selection in transformed cells, 
one or more promoters, and a polylinker region containing several restriction sites for 
insertion of foreign DNA. Plasmids typically used for transformation of E. coli 
include pBR322, pUClS, pUC19, pUCIlS, pUCll9, and Bluescript M13, all of 
which are described in sections 1.12-1.20 of Sambrook et al., supra. However, many 
other suitable vectors are available as well. These vectors contain genes coding for 
ampicillin and/or tetracycline resistance which enables cells transformed with these 
vectors to grow in the presence of these antibiotics. 

The promoters most commonly used in prokaryotic vectors include the 
p-lactamase (penicillinase) and lactose promoter systems (Chang etal. Nature, 
375:615 [1978]; Itakura et al., Science, 198:1056 [1977], Goeddel et al. Nature, 
281:544 [1979]) and a tryptophan (trp) promoter system (Goeddel et al., NucL Acids 
Res., 8:4057 [1980]; EPO Appl. Publ. No. 36,776), and the alkaline phosphatase 
systems. While these are the most commonJy used, other microbial promoters have 
been utilized, and details concerning their nucleotide sequences have been published, 
enabling a skilled worker to ligate them functionally into plasmid vectors (see 
Siebenlist et al.. Cell, 20:269 [1980]). 

Many eukaryotic proteins normally secreted from the cell contain an 
endogenous secretion signal sequence as part of the amino acid sequence. Thus, 
proteins normally found in the cytoplasm can be targeted for secretion by linking a 
signal sequence to the protein. This is readily accomplished by ligating DNA 
encoding a signal sequence to the 5' end of the DNA encoding the protein and then 
expressing this fusion protein in an appropriate host cell. The DNA encoding the 
signal sequence may be obtained as a restriction fragment from any gene encoding a 
protein with a signal sequence. Thus, prokaryotic, yeast, and eukaryotic signal 
sequences may be used herein, depending on the type of host cell utilized to practice 
the invention. The DNA and amino acid sequence encoding the signal sequence 
portion of several eukaryotic genes including, for example, human growth hormone, 
proinsulin, and prealbumin are known (see Stryer, Biochemistry W.H. Freeman and 
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Company, New York, NY, p. 769 [1988]), and can be used as signal sequences in 
appropriate eukaryotic host cells. Yeast signal sequences, as for example acid 
phosphatase (Arima et al., Nuc. Acids Res., 11:1657 [1983]), a-factor, alkaline 
phosphatase and invertase may be used to direct secretion from yeast host cells. 
5 Prokaryotic signal sequences from genes encoding, for example, LamB or OmpF 
(Wong et al.. Gene, 68:193 [1988]), MalE, PhoA, or beta-lactamase, as well as other 
genes, may be used to target proteins from prokaryotic cells into the culture medium- 
Trafficking sequences from plants, animals and microbes can be employed in 
the practice of the invention to direct the l-deoxyxylulose-5-phosphate synthase 

10 proteins of the present invention to the cytoplasm, endoplasmic reticulum, 
mitochondria or other cellular components, or to target the protein for export to the 
medium. These considerations apply to the overexpression of l-deoxyxylulose-5- 
phosphate synthase, and to direction of expression within cells or intact organisms to 
permit gene product fiinction in any desired location. 

15 The construction of suitable vectors containing DNA encoding replication 

sequences, regulatory sequences, phenotypic selection genes and the 1-deoxyxylulose- 
5-phosphate synthase DNA of interest are prepared using standard recombinant DNA 
procedures. Isolated plasmids and DNA fragments are cleaved, tailored, and ligated 
together in a specific order to generate the desired vectors, as is well known in the art 

20 (see, for example, Sambrook et al., supra). 

As discussed above, l-deoxyxyIuIose-5-phosphate synthase variants are 
preferably produced by means of mutation(s) that are generated using the method of 
site- specific mutagenesis. This method requires the synthesis and use of specific 
oligonucleotides that encode both the sequence of the desired mutation and a 

25 sufficient number of adjacent nucleotides to allow the oligonucleotide to stably 
hybridize to the DNA template. 

The foregoing may be more fully understood in connection with the following 
representative examples, in which "Plasmids" are designated by a lower case p 
followed by an alphanumeric designation. The starting plasmids used in this invention 

30 are either commercially available, publicly available on an unrestricted basis, or can be 
constructed from such available plasmids using published procedures. In addition, 
other equivalent plasmids are known in the art and will be apparent to the ordinary 
artisan. 

"Digestion", "cutting" or "cleaving" of DNA refers to catalytic cleavage of the 
35 DNA with an enzyme that acts only at particular locations in the DNA. These 
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enzymes are called restriction endonucleases, and the site along the DNA sequence 
where each enzyme cleaves is called a restriction site. The restriction enzymes used in 
this invention are commercially available and are used according to the instructions 
supplied by the manufacturers. (See also sections 1.60-1.61 and sections 3.38-3.39 of 
5 Sambroolc et al., supra.) 

"Recovery" or "isolation" of a given fragment of DNA from a restriction 
digest means separation of the resulting DNA fragment on a polyacrylamide or an 
agarose gel by electrophoresis, identification of the fragment of interest by 
comparison of its mobility versus that of marker DNA fragments of known molecular 
10 weight, removal of the gel section containing the desired fragment, and separation of 
the gel from DNA. This procedure is known generally. For example, see Lawn et al. 
{Nucleic Acids Res., 9:6103-6114 [1982]), and Goeddel et al. (Nucleic Acids Res., 
supra). 

These and other aspects of the present invention may be made more apparent 

15 in connection with the following representative examples that are presented for 
purposes of illustrating some of the inventive concepts. 

EXAMPLE 1 
cDNA Library Construction and Screening 
cDNA Library Construction and Screening. Peppermint 

20 {Mentha x piperita) leaf secretory cells are highly specialized for isoprenoid 
(monoterpene essential oil) formation and, thus, a highly enriched source of mRNA 
species encoding proteins involved in essential oil biosynthesis. Consequently, total 
RNA was extracted (Logemann, J. et al., Anal. Biochem. 163:16-20 (1987)) from 
secretory cells which had been isolated from 5-day-old peppermint leaves 

25 (Gershenzon, J. et al.. Anal. Biochem. 200:130-138 (1992)). Poly(A)-^-RNA was 
purified by chromatography on oligo(dT)-cellulose (Pharmacia), and 5 p,g of the 
resulting mRNA was utilized to construct a XZAP cDNA library according to the 
manufacturer's instructions (Stratagene). One hundred and fifty randomly picked and 
purified clones were in v/Vo-excised and the resulting phagemids were sequenced 

30 using T3 and T7 primers. In addition to several genes of known function in 
isoprenoid biosynthesis, two clones (designated pDSl and pDS2) were identified 
which exhibited significant sequence similarity to a recently described Arahidopsis 
thaliana transketolase gene {CLAJ) of unknown function, the disruption of which 
results in an albino mutant arrested in chloroplast development (Mandel, M A. et al, 

35 Plant J. 9:649-658 (1996)). The nucleotide sequence of the cDNA insert of pDSl is 
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set forth in SEQ ID NO: I, and the nucleotide sequence of the cDNA insert of pDS2 is 
set forth in SEQ ID N0:2. 

A set of 3,000 plaques was then screened with a nucleic acid probe (SEQ ID 
NO:9) derived by PGR from the pDSl cDNA insert (SEQ ID NO: 1). This procedure 
5 afforded 47 positive signals under high-stringency hybridization conditions. After one 
additional cycle of hybridization, the positive clones were in v/vo-excised, the insert 
sizes were determined by PGR, and the 20 largest clones were partially sequenced. 
Three of these clones (designated pDS16, pDS29 and pDS39) appeared to be of fiill- 
length and were entirely sequenced on both strands. The nucleotide sequence of the 

10 cDNA insert of pDS16 is set forth in SEQ ID NO: 3, the nucleotide sequence of the 
cDNA insert of pDS29 is set forth in SEQ ID NO:5 and the nucleotide sequence of 
the cDNA insert of pDS39 is set forth in SEQ ID NO:7. 

Structure of cDNA Insert of DXPS Clone pDS29 (SEQ ID NO:5). The 
cDNA insert of DXPS clone pDS29 (SEQ ID NO:5), which yielded the highest 

15 expressed level of synthase activity, contains an open reading frame (ORE) of 2172 
nucleotides. The first 70 deduced amino acid residues show the general 
characteristics of plastidial targeting sequences (von Heijne, G. et al., Eur, J, 
Biochem. 180:535-545 (1989)), consistent with the proposed subcellular location of 
the enzyme in plant cells. By excluding the putative transit peptide residues, the 

20 sequence corresponds to a mature protein of about 650 amino acids, with a predicted 
size of roughly 71 kDa. This compares to a deduced protein of 620 residues with a 
predicted size of 67.6 kDa described by Boronat and associates in a preliminary report 
on a DXPS clone from E. coli (Lois, L.M. et al., Third Terpnet Meeting of the 
European Network on Plant Isoprenoids, Poitiers, France, May 29-30 (1997)). An 

25 alignment of translated transketolase sequences (devoid of plastid-targeting peptides 
where appropriate) shows very high similarity/identity values between the peppermint 
DXPS and CLAl from Arabidopsis (Mandel, M A. et al.. Plant J. 9:649-658 (1996)) 
(85/77 %), ORF2814 (part of the pw/operon in the photosynthetic gene cluster) from 
the purple non-sulfur photosynthetic bacterium Rhodobacter capsulata (Youvan, 

30 D.G. et al. Cell 37:949-957 (1984)) (72/56 %), ORF f620 (map position 9.43 min, 
the presumptive DXPS of E, coli (Lois, L.M. et al.. Third Terpnet Meeting of the 
European Network on Plant Isoprenoids, Poitiers, France, May 29-30 (1997)); 
69/48 %), and a deduced protein from the cyanobacterium Synechocystis sp. strain 
PCG6803 (Kaneko, T. et al., DNA Res, 3:109-136 (1996)) (65/45 %) (FIGURE 4). 

35 These conserved sequences appear to form a new class of transketolases that is 
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distinct from the well-characterized transketolases involved in the pentose phosphate 
pathway (FIGURE 4), and the extensive sequence similarity among these genes of 
diverse origin suggests that they all encode DXPS or a very closely related synthase. 
In addition, the general transketolase consensus TPP-binding motif 
5 (GDG(X)7.8E(X)3^A(X)u.i3NN)(SEQ ID NO: 10) determined by Hawkins et ai. 
(Hawkins, C.F. et al., FEES Lett 255:77-82 (1989)) was observed in this new 
transketolase type as: 

DG(A/S)(X)T(A/G)G(Q/M)AXEAXN(N/H)AG(X)7. 
8(IA0(V/I)LNDN (SEQ ID NO: 1 l)(residues 219-250 of the peppermint sequence). 

10 EXAMPLE 2 

cDNA Expression in E. coli and Product Identification 
The cDNA inserts of clones pDS16 (SEQ ID NO:3), pDS29 (SEQ ID NO:5), 
and pDS39 (SEQ ID NO: 7) were evaluated by heterologous expression for an 
enzyme capable of catalyzing the condensation reaction of pyruvate and GAP to a 

15 deoxypentulose phosphate (FIGURE 1). 

E. coli SOLR cells harboring pDS16, pDS29 or pDS39 were grown at 37°C 
in 5 ml of LB medium supplemented with appropriate antibiotics to an ODgoo of 0.7, 
transferred to a new flask containing 50 ml of the same medium, and incubated at 
20°C for 2 h. After induction v^th 200 |imol isopropyl-l-thio-/?-£>-galactopyranoside 

20 (IPTG), the cells were maintained for another 14 h at 20°C. Bacteria were harvested 
by centrifugation (1800 x^, 10 min), washed with 5 ml of assay bufifer (100 mM 
sodium phosphate (pH6.5) containing 3 mM MgCl2, 0.1 mM EDTA, 5 mM NaF, 
20 mM phenylmethanesulfonyl fluoride, and 100 pM thiamin diphosphate), and then 
resuspended in 1 ml of assay buffer. Cells were disrupted by brief sonication at 

25 0-4°C, and the resuhing homogenate was centrifliged as above to pellet debris. An 
aliquot (50 |j,l) of the supernatant was transferred to a 600 pX Eppendorf tube, to 
which 30|iM [2-1^0] pyruvate (18.5 kBq) and 0.4 ^M A^-glyceraldehyde-3- 
phosphate (GAP) (or 0.4 (iM Z)-GAP) were added, and the mixture was incubated at 
23°C for 30 min. The reaction was terminated by addition of 70 |il acetone and 

30 fi-eezing at -20°C for 20 min. Following centrifugation (14,000 rpm, bench-top 
centrifuge) to remove denatured protein, the supernatant was transferred to a new vial 
and evaporated to dryness. The residue was dissolved in 40 jil H2O and analyzed by 
reversed-phase (C^g) ion-pair radio-HPLC using a procedure previously described 
with minor modifications (McCaskill, D. & Croteau, R., Anal Biochem. 215:142-149 

35 (1993)). 
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Bnzyme assays performed with extracts of IPTG-induced cells harboring 
plasmid pDS29, including the cDNA sequence set forth in SEQ ED NO:5, or pDS39, 
including the cDNA sequence set forth in SEQ ID NO: 7, showed the GAP-dependent 
appearance of a labeled product with of 35.5 min that was formed in significantly 
5 higher amounts than in control assays (extracts from cells containing vector without 
insert). The radio-labeled product was isolated by semi-preparative HPLC and 
hydrolyzed with excess acid phosphatase, and the resulting sugar was lyophilized and 
silylated (100 |al bis(trimethylsilyl)trifluoroacetamide, 10 [i\ pyridine and 100 ^ig 
Na2S04; 80°C for 1 h). GC-MS analysis (of the silylated biosynthetic product and of 

10 silylated authentic l-deoxy-£>-xylulose) was performed using a Hewlett-Packard 
5840A/5985B system equipped with a 30 m x 0.25 mm diameter fused silica column 
coated with a 0.25 \im film of HP 5MS (Hewlett-Packard). The oven was 
programmed fi-om 90°C (2 min hold) at 20°C/min to 250°C (2 min hold), then at 
20°C/min to 300°C at 10 psi He, and EI spectra were recorded at 70 eV with an 

15 electron multiplier voltage of 2200 V. Full spectra were acquired and selected 
diagnostic ions were monitored: m/z 307 [M^ - 43 (CH3CO)]; m/z 277 [M^ - 
73 ((CH3)3Si)]; m/z 218 [M^^ - 43 (CH3CO) - 89 ((CH3)3SiO)]; m/z 205 
[M^ - 145 (CH3COCHOSi(CH3)3)]; m/z 204 [((CH3)3SiOCHCH20Si(CH3)3)+]; 
m/z 147 [((CH3)2SiOSi(CH3)3)+]; m/z 132 [(Si(CH3)3)OCH2CHO)+]; m/z ill 

20 [((CH3)3SiOCH2CH2)+]; m/z 103 [((^3)3810^2)+]; m/z S9 [((CH3)3SiO)+]; 
m/z 73 [((CH3)3Si)+]. The silylated derivative of the biosynthetic product eluted at an 
Rt of 6.71 min; the silylated derivative of authentic l-deoxy-Z)-xylulose eluted at an 
of 6.70 min. 

In accordance with the detailed procedures set forth in the preceding 
25 paragraphs, E, coli cultures transfected with phagemids derived from pDS16, 
including the cDNA sequence set forth in SEQ ID NO:3, pDS29, including the cDNA 
sequence set forth in SEQ ID NO: 5, and pDS39, including the cDNA sequence set 
forth in SEQ ID NO: 7, were each induced with isopropyl-1 thio-y9 Z)-galacto- 
pyranoside (EPTG), the corresponding bacterial cells were harvested and 
30 homogenized, and the extracts were assayed using [2-^^*0] pyruvate and D,L- 
glyceraidehyde-3 phosphate as co-substrates. Preparations from E, coli cells 
transfected with pDS29, including the cDNA sequence set forth in SEQ ID NO: 5, and 
pDS39, including the cDNA sequence set forth in SEQ ID NO:7, yielded a prominent 
new radioactive component in the reaction mixture that upon reversed-phase ion-pair 
35 radio-HPLC exhibited a (35.5 tnin) consistent with that of a sugar (pentulose) 
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phosphate (McCaskill, D. & Croteau, R., Anal. Biochem. 215:142-149 (1993)). The 
same enzymatic product was generated with /)-glyceraldehyde-3 -phosphate as co- 
substrate, indicating that the Z)-antipode is the likely natural substrate of the 
functionally expressed transketolase. The presumptive pentulose phosphate product 
5 from preparative enzyme incubations (E. coli transformed with pDS29, including the 
cDNA sequence set forth in SEQ ID NO: 5) was purified by HPLC and hydrolyzed 
with acid phosphatase, and the resulting sugar was silylated. This derivatized product 
of the recombinant enzyme was then analyzed by combined capillary GC-MS and 
shown to possess the identical retention time (6.71 ± 0.03 min) and mass spectrum as 

10 that of an authentic sample of silylated 1-deoxy-D-xylulose (FIGURES 2A and 2B). 
The combined evidence thus indicated that a cDNA encoding l-deoxyxylulose-5- 
phosphate synthase (DXPS) had been acquired. DXPS activity was significantly 
higher in the IPTG-induced E. coli cells expressing pDS29, including the cDNA 
sequence set forth in SEQ ED NO: 5, when compared to identically treated cells 

15 containing the same plasmid devoid of cDNA insert (SEQ ID NO: 5) (7-foId higher 
than endogenous activity, n~7, p<0.01). 

EXAMPLE 3 

RNA Blot Analysis and Determination of Monoterpene Biosvnthetic Rate 

Peppermint oil gland secretory ceil RNA was isolated fi^om leaves of different 

20 ages, separated on a 1.5 % formaldehyde-agarose gel (5 \i% each lane), and blotted 
onto Nylon membranes. DXPS mRNA was detected v^th a 32p.iabeled probe (SEQ 
ID NO: 12) prepared from cDNA clone pDS29 cDNA insert (SEQ ID NO:5). 
Administration of ^^CO^ to peppermint plants and the isolation and quantification of 
the leaf monoterpenes produced were performed as described previously 

25 (Gershenzon, J. et al., Oecologia 96:583-592 (1993)). 

RNA blot analyses showed a direct correlation between steady-state levels of 
the DXPS message and monoterpene production, as determined by (Gershenzon, J. et 
al., Anal Biochem. 200:130-138 (1992)) CO2 incorporation, thus suggesting 
activation of the non-mevalonate pathway to supply the IPP precursor for subsequent 

30 monoterpene biosynthesis in peppermint oil glands (FIGURE 3). 

The cloning, characterization and expression of DXPS from peppermint 
provides direct evidence for the operation of the mevalonate-independent pathway in 
plants, where, in parallel with the classical cytosolic mevalonate pathway for sterol 
biosynthesis (Lichtenthaler, H.K. et al., FEBS Lett 400:271-274 (1997)), this 

35 plastidial pyruvate/GAP pathway fiinctions to synthesize a very broad range of 
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isoprenoids (Lichtenthaler, H.K. et aL, FEES Lett 400:271-274 (1997)); (Zeidler J.G, 
et al., Z Naturforsch. 52c: 15-23 (1997)); (Eisenreich, W. et al.. Tetrahedron Lett 
38:3889-3892 (1997)); (Eisenreich, et al., Proc. NatL Acad, Sci. USA 
93:6431-6436 (1996)); and (Schwarz, M.K., PhD thesis, ETH, Zurich, Switzerland 
5 (1994)). 

The novel transketolases of the present invention are highly conserved 
between bacteria and plants, but absent in animals which rely entirely on the classical 
mevalonate pathway for isoprenoid biosynthesis. The novel transketolases of the 
present invention are targeted to the plastids of plant cells, in which the new pathway 
10 operates, suggesting that plants have maintained the "bacterial" isoprenoid 
bio synthetic pathway from the prokaryotic endosymbiont that gave rise to this 
eukaryotic organelle. 

The new pathway, termed the pyruvate/glyceraldehyde-3 -phosphate pathway 
for which the initial steps have only recently been proposed, has been overlooked in 
1 5 the past in spite of the fact that, in plants and certain bacteria, it is quantitatively and 
fiinctionatly of greater significance than the classical mevalonate pathway. Thus, all 
plastid-derived isoprenoids of plants are formed by this route which yields a wide 
variety of structures with numerous functions in growth, development and defense. 

EXAMPLE 4 

20 Hybridization Conditions 

RNA samples from the following plant species were separated on a 1.5% 
agarose gel containing 6% formaldehyde: Pinus taeda (1 ng messenger RNA isolated 
from wounded stem); Pinus taeda (20 ng total RNA isolated from wounded stem); 
Pseudotsuga menziesii (2 pig total RNA isolated from wounded stem); Abies grandis 

25 (20 ^ig total RNA isolated from wounded stem); Taxus canadensis (20 |ig total RNA 
isolated from needles); Taxus cuspidata (20 lag total RNA isolated from needles); 
Oryza sativa (20 jig total RNA isolated from 20 day old developing seeds); Triiicum 
aestivum (20 iig total RNA isolated from leaves); Lycopersicon esculentum (20 \ig 
total RNA isolated from leaves); Hyoscyamus niger (20 ng total RNA isolated from 

30 leaves); Citrus limon (20 |ig total RNA isolated from leaves); Mentha spicata (10 (j.g 
total RNA isolated from glandular trichomes); Salvia officinales (20 |ig total RNA 
isolated from leaves); Coleus blumei (20 \xg total RNA isolated from leaves). 
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The gel containing the separated RNA samples was blotted onto Hybond 
Nylon membrane (Amersham) and was prehybridized for one hour at 42**C. The 
nucleic acid sequence set forth in SEQ ID NO: 5 was used as a template for generating 
a ^^P-labelled hybridization probe. Hybridization was carried out at 42X for ten 
5 hours. The composition of the hybridization and prehybridization buffer was: 30% 
formamide, 5 X Denhardt's reagent, 0.1% sodium dodecyl sulfate, 5 X SSPE, The 
composition of 1 liter of a 50 X stock solution of Denhardt's reagent is: 5g ficoll, 5g 
polyvinyl pyrolidone and 5g bovine serum albumin. The composition of 1 liter of a 10 
X stock solution of SSPE is 87.7g sodium chloride, 13 . 8g NaH2P04H20, 3 .7g EDTA 
10 at pH7.4. The blot was washed in 6 X SSC for ten minutes at 42°C. 
Autoradiography revealed that the hybridization probe recognized the corresponding 
mRNA species encoding a l-deoxyxylulose-5 -phosphate synthase from each sample 
of plant mRNA, 

High stringency was conditions under which the foregoing probe derived from 
15 the nucleic acid sequence set forth in SEQ ID NO: 5 will remain hybridized to 
Northern blotted mRNA species encoding l-deoxyxylulose-5 -phosphate synthase 
proteins of the present invention, or to Southern blotted DNA species encoding 1- 
deoxyxylulose- 5 -phosphate synthase proteins of the present invention are: two, 
fifteen minute washes in 2 X SSC at room temperature (18*'C to 25'*C), followed by 
20 two, twenty minute washes in 0.2 X SSC at 65°C). 

While the preferred embodiment of the invention has been illustrated and 
described, it will be appreciated that various changes can be made therein without 
departing from the spirit and scope of the invention. 
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The embodiments of the invention in which an exclusive property or privilege 
is claimed are defined as follows: 

1 . An isolated nucleic acid molecule encoding at least a functional portion 
of a l-deoxyxylulose-5-phosphate synthase from an essential oil plant. 

2. An isolated nucleic acid molecule of Claim 1 encoding at least a 
functional portion of a l-deoxyxylulose-5 -phosphate synthase from a species from the 
family Lamiaceae. 

3. An isolated nucleic acid molecule of Claim 2 encoding at least a 
functional portion of a l-deoxyxylulose-5 -phosphate synthase from a species from the 
genus Mentha. 

4. An isolated nucleic acid molecule of Claim 3 encoding at least a 
functional portion of sl Mentha piperita l-deoxyxylulose-5 -phosphate synthase. 

5. An isolated nucleic acid molecule of Claim 1 having a nucleotide 
sequence selected from the group consisting of SEQ ID NO: 3, SEQ ID NO: 5 and 
SEQIDNO:?. 

6. An isolated nucleic acid molecule of Claim 1 which encodes an amino 
acid sequence selected from the group consisting of SEQ ID NO:4, SEQ ID NO: 6 
and SEQIDN0:8. 

7. An isolated nucleic acid molecule, encoding at least a portion of a 1- 
deoxyxylulose-5 -phosphate synthase protein, from an essential oil plant, that 
hybridizes to the complementary strand of the nucleic acid sequence set forth in SEQ 
ID NO:3 under the Northern blot hybridization conditions set forth in Example 4, said 
isolated nucleic acid molecule being capable of remaining hybridized to the 
complementary strand of the nucleic acid sequence set forth in SEQ ID NO: 3 under 
the stringent Northern blot wash conditions set forth in Example 4. 

8. An isolated, recombinant l-deoxyxylulose-5-phosphate synthase 
protein from a plant species. 
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9. An isolated, recombinant l-deoxyxyluIose-5-phosphate synthase 
protein of Claim 8, said protein being from an essential oil plant species. 

10. An isolated, recombinant l-deoxyxylulose-5 -phosphate synthase 
protein of Claim 8, said protein being from the genus Mentha. 

11. An isolated, recombinant 1 -deoxyxyluiose-5-phosphate synthase 
protein of Claim 8, said protein being from Mentha piperita. 

12. An isolated, recombinant 1 -deoxyxylulose-5-phosphate synthase 
protein of Claim 8, said protein having the amino acid sequence of a protein selected 
from the group consisting of SEQ ID NO:4, SEQ ID NO:6 and SEQ ID NO:8. 

13. A replicable expression vector comprising a nucleic acid sequence of 
any one of Claim 1, Claim 2, Claim 3, Claim 4, Claim 5, Claim 6 and Claim 7. 

14. A host cell comprising a vector of Claim 13. 

15. A host cell of Claim 14 wherein said host cell is a plant cell 

16. A method of altering l-deoxyxylulose-5-phosphate synthase activity in 
a suitable host cell comprising introducing into said host cell an expression vector of 
Claim 13 under conditions enabling expression of a l-deoxyxylulose-5-phosphate 
synthase protein, or a functional portion thereof, encoded by said vector. 

17. The method of Claim 16 wherein said host cell is a plant cell. 

18. A method of reducing l-deoxyxylulose-5 -phosphate synthase activity 
in a suitable host cell comprising introducing into said host cell a vector comprising a 
nucleic acid sequence of any one of Claim 1, Claim 2, Claim 3, Claim 4, Claim 5, 
Claim 6 and Claim 7, said nucleic acid sequence being in antisense orientation relative 
to a promoter sequence that controls the transcription of said nucleic acid sequence. 
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AMENDED CLAIMS 

[received by the International Bureau on 9 February 1999 (09.02.99); 
original claims 1-4, 7 and 16 amended; remaining claims unchanged (2 pages)] 

The embodiments of the invention in which an exclusive property or privilege 
is claimed are defined as follows: 

L An isolated nucleic acid molecule encoding a 1 -deoxyxylulose-5- 
phosphate synthase from an essential oil plant. 

2. An isolated nucleic acid molecule of Claim 1 encoding a 
l-deoxyxylulose-5-phosphate synthase from a species from the family Lamiaceae. 

3. An isolated nucleic acid molecule of Claim 2 encoding a 
l-deoxyxylulose-5-phosphate synthase from a species from the genus Mentha. 

4. An isolated nucleic acid molecule of Claim 3 encoding a Mentha 
piperita l-deoxyxylulose-5-phosphate synthase. 

5- An isolated nucleic acid molecule of Claim 1 having a nucleotide 
sequence selected from the group consisting of SEQ ID NO:3, SEQ ID NO:5 and 
SEQ ID NO:7. 

6. An isolated nucleic acid molecule of Claim 1 which encodes an amino 
acid sequence selected from the group consisting of SEQ ID NO:4, SEQ ID NO:6 
and SEQ IDNO:8, 

7. An isolated nucleic acid molecule, encoding a l-deoxyxylulose-5- 
phosphate synthase protein, from an essential oil plant, that hybridizes to the 
complementary strand of the nucleic acid sequence set forth in SEQ ID NO: 3 under 
the Northern blot hybridization conditions set forth in Example 4. said isolated 
nucleic acid molecule being capable of remaining hybridized to the complementary 
strand of the nucleic acid sequence set forth in SEQ ID NO:3 under the stringent 
Northern blot wash conditions set forth in Example 4. 

8. An isolated, recombinant 1 -deoxyxylulose-5-phosphate synthase 
protein from a plant species. 

9. An isolated, recombinant 1 -deoxyxylulose-5-phosphate synthase 
protein of Claim 8. said protein being from an essential oil plant species. 
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10. An isolated, recombinant l-deoxyxylulose-5-phosphate synthase 
protein of Claim 8. said protein being from the genus Mentha. 

11. An isolated, recombinant l-deoxyxylulose-5-phosphate synthase 
protein of Claim 8. said protein being from Mentha piperita. 

12. An isolated, recombinant l-deoxyxylulose-5 -phosphate synthase 
protein of Claim 8, said protein having the amino acid sequence of a protein selected 
from the group consisting of SEQ ID NO:4, SEQ ID NO:6 and SEQ ID NO:8. 

13. A replicable expression vector comprising a nucleic acid sequence of 
any one of Claim 1. Claim 2, Claim 3, Claim 4, Claim 5, Claim 6 and Claim 7. 

14. A host cell comprising a vector of Claim 13. 

15. A host cell of Claim 14 wherein said host cell is a plant cell. 

16. A method of altering 1 -deoxyxylulose-5-phosphate synthase activity 
in a suitable host cell comprising introducing into said host cell an expression vector 
of Claim 13 under conditions enabling expression of a l-deoxyxylulose-5 -phosphate 
synthase protein encoded by said vector. 

17. The method of Claim 16 wherein said host cell is a plant cell. 

18. A method of reducing l-deoxyxylulose-5-phosphate synthase activity 
in a suitable host cell comprising introducing into said host cell a vector comprising a 
nucleic acid sequence of any one of Claim 1, Claim 2, Claim 3, Claim 4, Claim 5, 
Claim 6 and Claim 7, said nucleic acid sequence being in antisense orientation 
relative to a promoter sequence that controls the transcription of said nucleic acid 
sequence. 



AMENDED SHEET (ARTICLE 19) 

BNSDOCID: <WO 9911757A1 J_> 



wo 99/11757 



PCT/US98/ 18231 



t/4 



HO COOH 
PYRUVATE 



TPP HO^TPP 
OH 

OHC-^CH^Od) 

GLYCERALDEHYDE- 
3-PHOSPHATE 



OH 



TPP 



0 



OH 




CH20(B 
OH 

1-DEOXYXYLULOSE- 
5-PHOSPHATE 



CHMCE) 



ISOPENTENYL 
DIPHOSPHATE 



BNSDOCID: <WO 991 1 757A1 _l_> 



wo 99/11757 



PCT/US98/I8231 



2/4 



73 



1 

tt3 



75- 



50- 



25 



147 



103 



89 



117 



100 



132 



218 



205 
204 



140 



J L 



277 307 



180 

m/z 



220 



260 



300 



i 



/00-, 
75 
50H 
25 



73 



0 



103 



147 



89 



117 



100 



132 

III I- 1 J 



218 



205 
204 



r 

140 



It. I- . I. . Ii I. Ii 



180 220 

m/z 



277 

I, L 



307 



260 



300 



BNSDOCID: <WO 991 1757A1J_> 



wo 99/11757 



PCT/US98/I823I 



3/4 




LEAF AGE [d] 



BNSDOCID; <WO 991 1757A1J_> 



wo 99/11757 



PCT/US98/I823I 



4/4 



Co 




Co ^ 



esi cJ «2rL o G—i g Co 



Co 

i 
i 



Co 

i 

1^ 



CO 



Co 

i 

Pg2 



Co 



Co O 



Co 



i 



E<3 



Co 



BNSDOCID: <WO 991 1757A1J_> 



wo 99/11757 PCT/US98/18231 

SEQUENCE LISTING 

<110> Croteau, Rodney B 
Lange, Bernd M 
Wildung, Mark R 
McCaskill, David G 

<120> Nucleic and Amino Acid Sequences Relating to a Novel 
Transketolase, and Methods for the Expression Thereof 

<130> Novel trans ketolase from peppermint 

<140> 
<141> 

<150> 60/056,033 
<151> 1997-09-02 

<160> 12 

<170> Patentin Ver. 2.0 

<210> 1 
<211> 1949 
<212> DNA 

<213> Mentha piperita 
<400> 1 



aagagcgaga 


atgcacacaa 


ttaggcagac 


atttggatta 


gcaggtttcc 


ctaagagaga 


60 


tgaaagtgct 


cacgatgcct 


tcggcgccgg 


ccatagttct 


accagtattt 


ctgctggttt 


120 


agggatggcg 


gtggcgagag 


atttactgca 


gaagaacaac 


cacgtcatat 


cggtgatcgg 


180 


cgacggcgcc 


atgacagctg 


gacaagcgta 


cgaggcctta 


aacaacgccg 


gattcctcga 


240 


ttcgaatctc 


ataatcgttt 


taaacgacaa 


caagcaggtg 


tctctaccca 


cggccaccgt 


300 


cgacggccct 


gcgccgccgg 


tcggagctct 


gagcaaagcc 


ctgaccaagc 


tgcaagccag 


360 


cagaaaattc 


cggcagctcc 


gcgaagcagc 


aaagagtatg 


actaagcaga 


tgggagcacc 


420 


ggcacatgaa 


atagcttcga 


agttgacaca 


atacgtgaaa 


gggatgatgg 


ggaaaccagg 


480 


cgcttcactt 


ttcgaagaac 


tggggattta 


ttacatcgga 


ccagtcgacg 


gccataacgt 


540 


tgaagatctt 


gtttatattt 


tcaagaaagt 


taaggaaatg 


cctgcgcctg 


ggcctgttct 


600 


tattcatatc 


atcaccgaaa 


aaggcaaagg 


ctacccccct 


gcagaaattg 


ctgccgacaa 


660 


aatgcatggg 


gtggtgaagt 


ttgatgcgaa 


aactgggaaa 


cagatgaaga 


cgaagaacaa 


720 


gacgaagtca 


tacacccagt 


acttcgccga 


gtctctggtg 


gcggaggcgg 


agcacgacga 


780 


caagatcgtg 


gcgatccacg 


ccgccatggg 


gggcggcacc 


gggctcaaca 


tcttccagaa 


840 


gcagttcccg 


gaccggtgct 


tcgacgtcgg 


gatcgcggag 


cagcacgcgg 


tgacgttcgc 


900 


cgccggtatg 


gcggcggagg 


ggctgaagcc 


tttctgcgcc 


atctactcct 


ccttcctcca 


960 


gaggggctac 


gaccaggtgg 


tccacgacgt 


cgacctccag 


aagctcccgg 


tccggttcat 


1020 


gatggatcgg 


gcaggagtcg 


tcggcgccga 


cggccccacc 


cactgcggcg 


ccttcgacac 


1080 



-1- 

BNSDOCIO: <WO 991 1 757A1J_> 



wo 99/11757 PCT/US98/18231 



cacctacatg 


gcctgcctcc 


ccaacatggt 


ggtcatggct 


ccctccgacg 


aagcggagct 


1140 


catgaacatg 


atcgccaccg 


ccgccatcat 


cgacgaccga 


cctagctgcg 


tccggtaccc 


1200 


tagagggaac 


ggcatcggcg 


tcgctcttcc 


gtcgaacaac 


aaaggaactc 


cattagagat 


1260 


tggtaaggga 


agaatcttga 


aggaggggag 


caaagttgcg 


attctgggat 


tcggaaccat 


1320 


agtgcagaac 


tgcatggcgg 


cggcgaatct 


tctcgaacaa 


cacggaatct 


cagtaacagt 


1380 


agccgatgca 


agattctgca 


agccactcga 


tggggatttg 


ataaagaaac 


tggtgcagga 


1440 


gcatgaagta 


ctcatcactg 


ttgaagaagg 


atccatcggt 


ggattcagtg 


ctcacatttc 


1500 


tcatttcttg 


tccctcaatg 


gcttgctcga 


tggaaacctc 


aagtggaggc 


caatggttct 


1560 


tccagatagg 


tacattgatc 


atggagcaca 


gagtgatcaa 


atagaagaag 


cagggctgag 


1620 


tcctaagcat 


attgcaggga 


ctgttgtttc 


attgattgga 


ggaggaaagg 


acagtcttca 


1680 


tttgattaat 


aatttgtaat 


attattttaa 


tttatttctt 


cgaaaaggaa 


aagagaaaaa 


1740 


aatggagtct 


gaatttgagc 


agctgcaaaa 


attctccatg 


agagattagt 


gttaagatgt 


1800 


ataatgtaaa 


tatgggggaa 


ggtcaagact 


cttgacccat 


ggaaattggg 


ggagctgttc 


1860 


taaataattg 


ttgtgatggc 


agccttttct 


acatgtttta 


ttcaataaaa 


tcatttgtta 


1920 


cattttaaaa 


aaaaaaaaaa 


aaaaaaaaa 








1949 


<210> 2 
<211> 216 
<212> DNA 

<213> Mentha piperita 












<400> 2 
gtcaactccc 


agctggttgg 


gagaaggctc 


ttcctacata 


cacccctgag 


agcccagccg 


60 


atgccacaag 


gaacctctct 


cagcaaaacc 


tcaacgctct 


agcaaaagtt 


ctcccaggtc 


120 


tgctaggagg 


cagcgctgac 


cttgcctcct 


ccaacatgac 


ccttctcaaa 


acattcggcg 


180 


acttccaaag 


tagcactccc 


gaagaacgaa 


atgtaa 






216 


<210> 3 
<211> 2500 
<212> DNA 

<213> Mentha piperita 












<220> 
<221> CDS 
<222> (85) . 


. (2247) 












<400> 3 
ctctctcaac 


acctctttcc 


tttcttcttc 


actagctact 


taatttagtt 


caagaaagaa 


60 



gagagagaaa gaagagagga agag atg gca tct tec tgt gga gtt ate aag 111 

Met Ala Ser Ser Cys Gly Val lie Lys 
1 5 

age agt ttc ttg cca teg etc eat tct gag gat tct ace ttc tta tea 159 
Ser Ser Phe Leu Pro Ser Leu His Ser Glu Asp Ser Thr Phe Leu Ser 
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10 15 20 25 

cgt get cct act tct ctt ccc etc aaa aat cat aag tta aat gtg gta 

Arg Ala Pro Thr Ser Leu Pro Leu Lys Asn His Lys Leu Asn Val Val 

30 35 40 

gca get etc caa eaa gat agt teg aae gac gtc gtt cct age gga gac 

Ala Ala Leu Gin Gin Asp Ser Ser Asn Asp Val Val Pro Ser Gly Asp 

45 50 55 

agg ctg age egg ccg aaa tea aga gca ctg agt ttc acc gga gag aag 

Arg Leu Ser Arg Pro Lys Ser Arg Ala Leu Ser Phe Thr Gly Glu Lys 

60 65 70 

ect cce att cct ata ctg gac acc ate aac tac cct aat cac atg aaa 

Pro Pro He Pro He Leu Asp Thr He Asn Tyr Pro Asn His Met Lys 
75 80 85 

aat ctt tee gtc gag gaa etc gca aac eta get gat gaa ctg agg gaa 

Asn Leu Ser Val Glu Giu Leu Ala Asn Leu Ala Asp Glu Leu Arg Glu 

90 95 100 105 

gag ata gtg tac acg gtg teg aaa acc ggc ggc cat ctt age teg age 

Glu He Val Tyr Thr Val Ser Lys Thr Gly Gly His Leu Ser Ser Ser 

110 115 120 

eta gge gtg teg gag etc acc gtc gca ctt cat cac gtt ttc aac acg 

Leu Gly Val Ser Glu Leu Thr Val Ala Leu His His Val Phe Asn Thr 

125 130 135 



207 



255 



303 



351 



399 



447 



495 



ccc gat gac aaa ate ate tgg gac gtc ggc cac cag get tac cca cac 543 
Pro Asp Asp Lys He lie Trp Asp Val Gly His Gin Ala Tyr Pro His 
140 145 150 

aaa ate ttg acc ggg aga aga gcg aga atg cac aca att agg cag aca 591 
Lys He Leu Thr Gly Arg Arg Ala Arg Met His Thr He Arg Gin Thr 
155 160 165 

ttt gga tta gca ggt ttc cct aag aga gat gaa agt get eae gat gcc 
Phe Gly Leu Ala Gly Phe Pro Lys Arg Asp Glu Ser Ala His Asp Ala 
170 175 180 185 

ttc ggc gcc gge cat agt tct acc agt att tet get ggt tta ggg atg 
Phe Gly Ala Gly His Ser Ser Thr Ser He Ser Ala Gly Leu Gly Met 
190 195 200 

gcg gtg gcg aga gat tta ctg cag aag aac aac cac gtc ata teg gtg 
Ala Val Ala Arg Asp Leu Leu Gin Lys Asn Asn His Val He Ser Val 
205 210 215 

ate gge gac gge gee atg aca get gga caa gcg tac gag gee tta aac 
He Gly Asp Gly Ala Met Thr Ala Gly Gin Ala Tyr Glu Ala Leu Asn 
220 225 230 

aac gee gga ttc etc gat teg aat etc ata ate gtt tta aac gac aac 
Asn Ala Gly Phe Leu Asp Ser Asn Leu He He Val Leu Asn Asp Asn 
235 240 245 

aag cag gtg tet eta ccc acg gcc acc gtc gac ggc cct gcg ecg ccg 
Lys Gin Val Ser Leu Pro Thr Ala Thr Val Asp Gly Pro Ala Pro Pro 
250 255 260 265 

gtc gga get ctg age aaa gcc ctg acc aag ctg caa gcc age aga aaa 927 
Val Gly Ala Leu Ser Lys Ala Leu Thr Lys Leu Gin Ala Ser Arg Lys 
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687 



735 



783 



831 



879 
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270 275 280 

ttc egg cag etc cgc gaa gca gca aag agt atg act aag cag atg gga 975 
Phe Arg Gin Leu Arg Glu Ala Ala Lys Ser Met Thr Lys Gin Met Gly 
285 290 295 

gca ccg gca cat gaa ata get teg aag ttg aca caa tac gtg aaa ggg 1023 
Ala Pro Ala His Glu lie Ala Ser Lys Leu Thr Gin Tyr Val Lys Gly 
300 305 310 

atg atg ggg aaa eca ggc get tea ctt ttc gaa gaa ctg ggg att tat 1071 
Met Met Gly Lys Pro Gly Ala Ser Leu Phe Glu Glu Leu Gly lie Tyr 
315 320 325 

tac ate gga cca gtc gac gtt gaa gat ctt gtt tat att ttc aag aaa 1119 
Tyr lie Gly Pro Val Asp Val Glu Asp Leu Val Tyr lie Phe Lys Lys 
330 335 340 345 

gtt aag gaa atg cct gcg cct ggg cct gtt ctt att cat ate ate acc 1167 
Val Lys Glu Met Pro Ala Pro Gly Pro Val Leu lie His He He Thr 
350 355 360 

gaa aaa ggc aaa ggc tac cce cct gca gaa att get gcc gac aaa atg 1215 
Glu Lys Gly Lys Gly Tyr Pro Pro Ala Glu lie Ala Ala Asp Lys Met 
365 370 375 

cat ggg gtg gtg aag ttt gat gcg aaa act ggg aaa cag atg aag acg 1263 
His Gly Val Val Lys Phe Asp Ala Lys Thr Gly Lys Gin Met Lys Thr 
380 385 390 

aag aac aag acg aag tea tac acc cag tac ttc gcc gag tct ctg gtg 1311 
Lys Asn Lys Thr Lys Ser Tyr Thr Gin Tyr Phe Ala Glu Ser Leu Val 
395 400 405 

gcg gag gcg gag cac gac gac aag ate gtg gcg ate cac gcc gcc atg 1359 
Ala Glu Ala Glu His Asp Asp Lys lie Val Ala He His Ala Ala Met 
410 415 420 425 

999 ggc ggc acc ggg etc aac ate ttc cag aag cag ttc ccg gac egg 1407 
Gly Gly Gly Thr Gly Leu Asn He Phe Gin Lys Gin Phe Pro Asp Arg 
430 435 440 

tgc ttc gac gtc ggg ate gcg gag cag cac gcg gtg acg ttc gcc gcc 1455 
Cys Phe Asp Val Gly He Ala Glu Gin His Ala Val Thr Phe Ala Ala 
445 450 455 

ggt atg gcg gcg gag ggg ctg aag cct ttc tgc gcc ate tac tec tec 1503 
Gly Met Ala Ala Glu Gly Leu Lys Pro Phe Cys Ala He Tyr Ser Ser 
460 465 470 

ttc etc cag agg ggc tac gac cag gtg gtc cac gac gtc gac etc cag 1551 
Phe Leu Gin Arg Gly Tyr Asp Gin Val Val His Asp Val Asp Leu Gin 
475 480 485 

aag etc ccg gtc egg ttc atg atg gat egg gca gga gtc gtc ggc gcc 1599 
Lys Leu Pro Val Arg Phe Met Met Asp Arg Ala Gly Val Val Gly Ala 
490 495 500 505 

gac ggc cce acc cac tgc ggc gcc ttc gac acc acc tac atg gcc tgc 1647 
Asp Gly Pro Thr His Cys Gly Ala Phe Asp Thr Thr Tyr Met Ala Cys 
510 515 520 

etc ecc aac atg gtg gtc atg get cce tec gac gaa gcg gag etc atg 1695 
Leu Pro Asn Met Val Val Met Ala Pro Ser Asp Glu Ala Glu Leu Met 
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525 530 535 

aac atg ate gcc acc gcc gcc ate ate gac gac cga ect age tgc gtc 1743 
Asn Met lie Ala Thr Ala Ala lie lie Asp Asp Arg Pro Ser Cys Val 
540 545 550 

egg tac ect aga ggg aac ggc ate ggc gtc get ctt ccg teg aac aac 1791 
Arg Tyr Pro Arg Gly Asn Gly lie Gly Val Ala Leu Pro Ser Asn Asn 
555 560 565 

aaa gga act eca tta gag att ggt aag gga aga ate ttg aag gag ggg 1839 
Lys Gly Thr Pro Leu Glu lie Gly Lys Gly Arg lie Leu Lys Glu Gly 
570 575 580 585 

age aaa gtt gcg att ctg gga ttc gga acc ata gtg cag aac tgc atg 1887 
Ser Lys Val Ala lie Leu Gly Phe Gly Thr lie Val Gin Asn Cys Met 
590 595 600 

gcg gcg gcg aat ett etc gaa caa cac gga ate tea gta aca gta gee 1935 
Ala Ala Ala Asn Leu Leu Glu Gin His Gly lie Ser Val Thr Val Ala 
605 610 615 

gat gea aga ttc tgc aag cea etc gat ggg gat ttg ata aag aaa ctg 1983 
Asp Ala Arg Phe Cys Lys Pro Leu Asp Gly Asp Leu lie Lys Lys Leu 
620 625 630 

gtg cag gag cat gaa gta etc ate act gtt gaa gaa gga tec ate ggt 2031 
Val Gin Glu His Glu Val Leu lie Thr Val Glu Glu Gly Ser He Gly 
635 640 645 

gga ttc agt get eac att tet cat ttc ttg tee etc aat ggc ttg etc 2079 
Gly Phe Ser Ala His lie Ser His Phe Leu Ser Leu Asn Gly Leu Leu 
650 655 660 665 

gat gga aac etc aag tgg agg cea atg gtt ctt eca gat agg tac att 2127 
Asp Gly Asn Leu Lys Trp Arg Pro Met Val Leu Pro Asp Arg Tyr He 
670 675 680 

gat cat gga gea cag agt gat caa ata gaa gaa gea ggg ctg agt ect 2175 
Asp His Gly Ala Gin Ser Asp Gin He Glu Glu Ala Gly Leu Ser Pro 
685 690 695 

aag eat att gea ggg act gtt gtt tea ttg att gga gga gga aag gac 2223 
Lys His He Ala Gly Thr Val Val Ser Leu He Gly Gly Gly Lys Asp 
700 705 710 

agt ctt cat ttg att aat aat ttg taatattatt ttaatttatt tcttcgaaaa 2277 
Ser Leu His Leu He Asn Asn Leu 
715 720 

ggaaaagaga aaaaaatgga gtctgaattt gageagctge aaaaattctc catgagagat 2337 

tagtgttaag atgtataatg taaatatggg ggaaggtcaa gactcttgac ccatggaaat 2397 

tgggggaget gttctaaata attgttgtga tggcagcett ttetacatgt tttattcaat 2457 

aaaatcattt gttacatttt aaaaaaaaaa aaaaaaaaaa aaa 2500 

<210> 4 
<211> 721 
<212> PRT 

<213> Mentha piperita 
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<400> 4 

Met Ala Ser Ser Cys Gly Val lie Lys Ser Ser Phe Leu Pro Ser Leu 
1 5 10 15 

His Ser Glu Asp Ser Thr Phe Leu Ser Arg Ala Pro Thr Ser Leu Pro 
20 25 30 

Leu Lys Asn His Lys Leu Asn Val Val Ala Ala Leu Gin Gin Asp Ser 
35 40 45 

Ser Asn Asp Val Val Pro Ser Gly Asp Arg Leu Ser Arg Pro Lys Ser 
50 55 60 

Arg Ala Leu Ser Phe Thr Gly Glu Lys Pro Pro lie Pro lie Leu Asp 
65 70 75 80 

Thr lie Asn Tyr Pro Asn His Met Lys Asn Leu Ser Val Glu Glu Leu 
85 90 95 

Ala Asn Leu Ala Asp Glu Leu Arg Glu Glu He Val Tyr Thr Val Ser 
100 105 110 

Lys Thr Gly Gly His Leu Ser Ser Ser Leu Gly Val Ser Glu Leu Thr 
115 120 125 

Val Ala Leu His His Val Phe Asn Thr Pro Asp Asp Lys He He Trp 
130 135 140 

Asp Val Gly His Gin Ala Tyr Pro His Lys He Leu Thr Gly Arg Arg 
145 150 155 160 

Ala Arg Met His Thr He Arg Gin Thr Phe Gly Leu Ala Gly Phe Pro 
165 170 175 

Lys Arg Asp Glu Ser Ala His Asp Ala Phe Gly Ala Gly His Ser Ser 
180 185 190 

Thr Ser He Ser Ala Gly Leu Gly Met Ala Val Ala Arg Asp Leu Leu 
195 200 205 

Gin Lys Asn Asn His Val He Ser Val He Gly Asp Gly Ala Met Thr 
210 215 220 

Ala Gly Gin Ala Tyr Glu Ala Leu Asn Asn Ala Gly Phe Leu Asp Ser 
225 230 235 240 

Asn Leu He He Val Leu Asn Asp Asn Lys Gin Val Ser Leu Pro Thr 
245 250 255 

Ala Thr Val Asp Gly Pro Ala Pro Pro Val Gly Ala Leu Ser Lys Ala 
260 265 270 

Leu Thr Lys Leu Gin Ala Ser Arg Lys Phe Arg Gin Leu Arg Glu Ala 
275 280 285 

Ala Lys Ser Met Thr Lys Gin Met Gly Ala Pro Ala His Glu He Ala 
290 295 300 

Ser Lys Leu Thr Gin Tyr Val Lys Gly Met Met Gly Lys Pro Gly Ala 
305 310 315 320 

Ser Leu Phe Glu Glu Leu Gly He Tyr Tyr He Gly Pro Val Asp Val 
325 330 335 
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Glu Asp Leu Val Tyr lie Phe Lys Lys Val Lys Glu Met Pro Ala Pro 
340 345 350 

Gly Pro Val Leu lie His lie lie Thr Glu Lys Gly Lys Gly Tyr Pro 
355 360 365 

Pro Ala Glu lie Ala Ala Asp Lys Met His Gly Val Val Lys Phe Asp 
370 375 380 

Ala Lys Thr Gly Lys Gin Met Lys Thr Lys Asn Lys Thr Lys Ser Tyr 
385 390 395 400 

Thr Gin Tyr Phe Ala Glu Ser Leu Val Ala Glu Ala Glu His Asp Asp 
405 410 415 

Lys lie Val Ala lie His Ala Ala Met Gly Gly Gly Thr Gly Leu Asn 
420 425 430 

lie Phe Gin Lys Gin Phe Pro Asp Arg Cys Phe Asp Val Gly lie Ala 
435 440 445 

Glu Gin His JKla Val Thr Phe Ala Ala Gly Met Ala Ala Glu Gly Leu 
450 455 460 

Lys Pro Phe Cys Ala lie Tyr Ser Ser Phe Leu Gin Arg Gly Tyr Asp 
465 470 475 480 

Gin Val Val His Asp Val Asp Leu Gin Lys Leu Pro Val Arg Phe Met 
485 490 495 

Met Asp Arg Ala Gly Val Val Gly Ala Asp Gly Pro Thr His Cys Gly 
500 505 510 

Ala Phe Asp Thr Thr Tyr Met Ala Cys Leu Pro Asn Met Val Val Met 
515 520 525 

Ala Pro Ser Asp Glu Ala Glu Leu Met Asn Met lie Ala Thr Ala Ala 
530 535 540 

lie lie Asp Asp Arg Pro Ser Cys Val Arg Tyr Pro Arg Gly Asn Gly 
545 550 555 560 

lie Gly Val Ala Leu Pro Ser Asn Asn Lys Gly Thr Pro Leu Glu He 
565 570 575 

Gly Lys Gly Arg He Leu Lys Glu Gly Ser Lys Val Ala He Leu Gly 
580 585 590 

Phe Gly Thr He Val Gin Asn Cys Met Ala Ala Ala Asn Leu Leu Glu 
595 600 605 

Gin His Gly He Ser Val Thr Val Ala Asp Ala Arg Phe Cys Lys Pro 
610 615 620 

Leu Asp Gly Asp Leu He Lys Lys Leu Val Gin Glu His Glu Val Leu 
625 630 635 640 

He Thr Val Glu Glu Gly Ser He Gly Gly Phe Ser Ala His He Ser 
645 650 655 

His Phe Leu Ser Leu Asn Gly Leu Leu Asp Gly Asn Leu Lys Trp Arg 
660 665 670 

Pro Met Val Leu Pro Asp Arg Tyr He Asp His Gly Ala Gin Ser Asp 
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675 680 685 

Gin lie Glu Glu Ala Gly Leu Ser Pro Lys His lie Ala Gly Thr Val 
690 695 700 

Val Ser Leu lie Gly Gly Gly Lys Asp Ser Leu His Leu lie Asn Asn 
705 710 715 720 

Leu 



<210> 5 
<211> 2509 
<212> DNA 

<213> Mentha piperita 

<220> 

<221> CDS 

<222> (85) . . (2256) 

<400> 5 

ttctctctca acacctcttt cctttcttca ctagctactt aatttagttc aagaaagaaa 60 



gagagagaaa gaagagagga agag atg gca tct tec tgt gga gtt ate aag 

Met Ala Ser Ser Cys Gly Val lie Lys 
1 5 



eta ggc gtg teg gag etc acc gtc gca ctt cat cac gtt ttc aac aeg 

Leu Gly Val Ser Glu Leu Thr Val Ala Leu His His Val Phe Asn Thr 

125 130 135 

ecc gat gac aaa ate ate tgg gac gte ggc cac cag get tac cca cac 

Pro Asp Asp Lys lie lie Trp Asp Val Gly His Gin Ala Tyr Pro His 

140 145 150 



■8- 



111 



age agt ttc ttg cca teg etc cat tct gag gat tct acc ttc tta tea 159 

Ser Ser Phe Leu Pro Ser Leu His Ser Glu Asp Ser Thr Phe Leu Ser 

10 15 20 25 

cgt get cct act tct ctt ecc etc aaa aat eat aag tta aat gtg gta 207 

Arg Ala Pro Thr Ser Leu Pro Leu Lys Asn His Lys Leu Asn Val Val 

30 35 40 

gca get etc caa caa gat agt teg aac gac gtc gtt cct age gga gac 255 

Ala Ala Leu Gin Gin Asp Ser Ser Asn Asp Val Val Pro Ser Gly Asp 

45 50 55 

agg etg age egg ccg aaa tea aga gca ctg agt ttc acc gga gag aag 

Arg Leu Ser Arg Pro Lys Ser Arg Ala Leu Ser Phe Thr Gly Glu Lys 

60 65 70 

cct ecc att cet ata ctg gac acc ate aac tac cct aat cac atg aaa 

Pro Pro lie Pro lie Leu Asp Thr lie Asn Tyr Pro Asn His Met Lys 

75 80 85 

aat ctt tec gtc gag gaa etc gca aac eta get gat gaa ctg agg gaa 399 

Asn Leu Ser Val Glu Glu Leu Ala Asn Leu Ala Asp Glu Leu Arg Glu 

90 95 100 105 

gag ata gtg tac aeg gtg teg aaa acc ggc ggc cat ctt age teg age 447 

Glu lie Val Tyr Thr Val Ser Lys Thr Gly Gly His Leu Ser Ser Ser 

110 115 120 



303 



351 



495 



543 
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aaa ate ttg acc ggg aga aga gcg aga atg cac aca att agg cag aca 591 
Lys He Leu Thr Gly Arg Arg Ala Arg Met His Thr He Arg Gin Thr 
155 160 165 

ttt gga tta gca ggt ttc cct aag aga gat gaa agt get cac gat gcc 
Phe Gly Leu Ala Gly Phe Pro Lys Arg Asp Glu Ser Ala His Asp Ala 
170 175 180 185 

ttc ggc gcc ggc cat agt tct acc agt att tct get ggt tta ggg atg 
Phe Gly Ala Gly His Ser Ser Thr Ser He Ser Ala Gly Leu Gly Met 
190 195 200 

gcg gtg geg aga gat tta ctg cag aag aac aae cac gtc ata teg gtg 
Ala Val Ala Arg Asp Leu Leu Gin Lys Asn Asn His Val lie Ser Val 
205 210 215 

ate ggc gac ggc gcc atg aca get gga caa gcg tac gag gcc tta aac 
He Gly Asp Gly Ala Met Thr Ala Gly Gin Ala Tyr Glu Ala Leu Asn 
220 225 230 

aac gcc gga ttc etc gat teg aat etc ata ate gtt tta aac gac aac 831 
Asn Ala Gly Phe Leu Asp Ser Asn Leu He He Val Leu Asn Asp Asn 
235 240 245 

aag cag gtg tct eta ccc acg gcc acc gtc gac ggc cct gcg ccg ccg 879 
Lys Gin Val Ser Leu Pro Thr Ala Thr Val Asp Gly Pro Ala Pro Pro 
250 255 260 265 

gtc gga get ctg age aaa gcc ctg acc aag ctg caa gcc age aga aaa 927 
Val Gly Ala Leu Ser Lys Ala Leu Thr Lys Leu Gin Ala Ser Arg Lys 
270 275 280 



ttc egg cag etc cgc gaa gca gca aag agt atg act aag cag atg gga 
Phe Arg Gin Leu Arg Glu Ala Ala Lys Ser Met Thr Lys Gin Met Gly 
285 290 295 



ttc aag aaa gtt aag gaa atg cct gcg cct ggg cct gtt ctt att cat 
Phe Lys Lys Val Lys Glu Met Pro Ala Pro Gly Pro Val Leu He His 
350 355 360 



gac aaa atg cat ggg gtg gtg aag ttt gat gcg aaa act ggg aaa cag 
Asp Lys Met His Gly Val Val Lys Phe Asp Ala Lys Thr Gly Lys Gin 
380 385 390 



639 



687 



735 



783 



975 



gca ccg gca cat gaa ata get teg aag ttg aca caa tac gtg aaa ggg 1023 

Ala Pro Ala His Glu He Ala Ser Lys Leu Thr Gin Tyr Val Lys Gly 
300 305 310 

atg atg ggg aaa eca ggc get tea ctt ttc gaa gaa ctg ggg att tat 1071 

Met Met Gly Lys Pro Gly Ala Ser Leu Phe Glu Glu Leu Gly He Tyr 

315 320 325 

tac ate gga cca gtc gac ggc cat aac gtt gaa gat ctt gtt tat att 1119 

Tyr He Gly Pro Val Asp Gly His Asn Val Glu Asp Leu Val Tyr He 
330 335 340 345 



1167 



ate ate ace gaa aaa ggc aaa ggc tac ccc cct gca gaa att get gcc 1215 
He He Thr Glu Lys Gly Lys Gly Tyr Pro Pro Ala Glu He Ala Ala 
365 370 375 



1263 



atg aag acg aag aac aag acg aag tea tac acc cag tac ttc gee gag 1311 
Met Lys Thr Lys Asn Lys Thr Lys Ser Tyr Thr Gin Tyr Phe Ala Glu 
395 400 405 
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tct ctg gtg gcg gag gcg gag cac gac gac aag ate gtg gcg ate cac 1359 

Ser Leu Val Ala Glu Ala Glu His Asp Asp Lys lie Val Ala lie His 

410 415 420 425 

gcc gcc atg ggg ggc ggc acc ggg etc aac ate ttc cag aag eag tte 1407 

Ala Ala Met Gly Gly Gly Thr Gly Leu Asn lie Phe Gin Lys Gin Phe 

430 435 440 

eeg gac egg tgc ttc gac gtc ggg ate gcg gag cag cac gcg gtg acg 1455 

Pro Asp Arg Cys Phe Asp Val Gly lie Ala Glu Gin His Ala Val Thr 

445 450 455 

tte gcc gcc ggt atg gcg gcg gag ggg ctg aag cct ttc tgc gcc ate 1503 

Phe Ala Ala Gly Met Ala Ala Glu Gly Leu Lys Pro Phe Cys Ala He 

460 465 470 

tac tec tec ttc etc cag agg ggc tac gac cag gtg gtc cac gac gtc 1551 

Tyr Ser Ser Phe Leu Gin Arg Gly Tyr Asp Gin Val Val His Asp Val 

475 480 485 



gac etc eag aag etc eeg gtc egg ttc atg atg gat egg gea gga gtc 

Asp Leu Gin Lys Leu Pro Val Arg Phe Met Met Asp Arg Ala Gly Val 

490 495 500 505 

gtc ggc gcc gac ggc ccc acc cae tgc ggc gee tte gac acc acc tac 

Val Gly Ala Asp Gly Pro Thr His Cys Gly Ala Phe Asp Thr Thr Tyr 

510 515 520 

atg gee tgc etc ecc aac atg gtg gtc atg get ccc tec gac gaa gcg 

Met Ala Cys Leu Pro Asn Met Val Val Met Ala Pro Ser Asp Glu Ala 

525 530 535 

gag etc atg aac atg ate gcc acc gcc gcc ate ate gac gac cga cct 

Glu Leu Met Asn Met He Ala Thr Ala Ala He He Asp Asp Arg Pro 

540 545 550 

age tgc gtc egg tac cct aga ggg aac ggc ate ggc gtc get ctt eeg 

Ser Cys Val Arg Tyr Pro Arg Gly Asn Gly He Gly Val Ala Leu Pro 

555 560 565 



aag gag ggg age aaa gtt gcg att ctg gga ttc gga acc ata gtg cag 
Lys Glu Gly Ser Lys Val Ala He Leu Gly Phe Gly Thr He Val Gin 
590 595 600 

aac tgc atg gcg gcg gcg aat ctt etc gaa caa cac gga ate tea gta 
Asn Cys Met Ala Ala Ala Asn Leu Leu Glu Gin His Gly He Ser Val 
605 610 615 

aca gta gcc gat gca aga ttc tgc aag eca etc gat ggg gat ttg ata 
Thr Val Ala Asp Ala Arg Phe Cys Lys Pro Leu Asp Gly Asp Leu He 
620 625 630 

aag aaa ctg gtg cag gag eat gaa gta etc ate act gtt gaa gaa gga 
Lys Lys Leu Val Gin Glu His Glu Val Leu He Thr Val Glu Glu Gly 
635 640 645 

tec ate ggt gga ttc agt get cac att tct cat ttc ttg tec etc aat 
Ser He Gly Gly Phe Ser Ala His He Ser His Phe Leu Ser Leu Asn 
650 655 660 665 



1599 



1647 



1695 



1743 



1791 



teg aac aac aaa gga act cca tta gag att ggt aag gga aga ate ttg 1839 
Ser Asn Asn Lys Gly Thr Pro Leu Glu He Gly Lys Gly Arg He Leu 
570 575 580 585 



1887 



1935 



1983 



2031 



2079 
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ggc ttg etc gat gga aac etc aag tgg agg cea atg gtt ctt cca gat 2127 
Gly Leu Leu Asp Gly Asn Leu Lys Trp Arg Pro Met Val Leu Pro Asp 
670 675 680 

agg tac att gat cat gga gca cag agt gat caa ata gaa gaa gca ggg 2175 
Arg Tyr lie Asp His Gly Ala Gin Ser Asp Gin lie Glu Glu Ala Gly 
685 690 695 

ctg agt cct aag cat att gca ggg act gtt gtt tea ttg att gga gga 2223 
Leu Ser Pro Lys His lie Ala Gly Thr Val Val Ser Leu lie Gly Gly 
700 705 710 

gga aag gac agt ctt cat ttg att aat aat ttg taatattatt ttaatttatt 2276 
Gly Lys Asp Ser Leu His Leu lie Asn Asn Leu 
715 720 

tcttcgaaaa ggaaaagaga aaaaaatgga gtctgaattt gagcagctgc aaaaattctc 2336 

catgagagat tagtgttaag atgtataatg taaatatggg ggaaggtcaa gactcttgac 2396 

ccatggaaat tgggggagct gttctaaata attgttgtga tggcagcctt ttctacatgt 2456 

tttattcaat aaaatcattt gttacatttt aaaaaaaaaa aaaaaaaaaa aaa 2509 



<210> 6 

<211> 724 

<212> PRT 

<213> Mentha piperita 

<400> 6 

Met Ala Ser Ser Cys Gly Val lie Lys Ser Ser Phe Leu Pro Ser Leu 
1 5 10 15 

His Ser Glu Asp Ser Thr Phe Leu Ser Arg Ala Pro Thr Ser Leu Pro 
20 25 30 

Leu Lys Asn His Lys Leu Asn Val Val Ala Ala Leu Gin Gin Asp Ser 
35 40 45 

Ser Asn Asp Val Val Pro Ser Gly Asp Arg Leu Ser Arg Pro Lys Ser 
50 55 60 

Arg Ala Leu Ser Phe Thr Gly Glu Lys Pro Pro lie Pro lie Leu Asp 
65 70 75 80 

Thr lie Asn Tyr Pro Asn His Met Lys Asn Leu Ser Val Glu Glu Leu 
85 90 95 

Ala Asn Leu Ala Asp Glu Leu Arg Glu Glu lie Val Tyr Thr Val Ser 
100 105 110 

Lys Thr Gly Gly His Leu Ser Ser Ser Leu Gly Val Ser Glu Leu Thr 
115 120 125 

Val Ala Leu His His Val Phe Asn Thr Pro Asp Asp Lys lie lie Trp 
130 135 140 

Asp Val Gly His Gin Ala Tyr Pro His Lys lie Leu Thr Gly Arg Arg 
145 150 155 160 



Ala Arg Met His Thr lie Arg Gin Thr Phe Gly Leu Ala Gly Phe Pro 
165 170 175 
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Lys Arg Asp Glu Ser Ala His Asp Ala Phe Gly Ala Gly His Ser Ser 
180 185 190 

Thr Ser lie Ser Ala Gly Leu Gly Met Ala Val Ala Arg Asp Leu Leu 
195 200 205 

Gin Lys Asn Asn His Val lie Ser Val lie Gly Asp Gly Ala Met Thr 
210 215 220 

Ala Gly Gin Ala Tyr Glu Ala Leu Asn Asn Ala Gly Phe Leu Asp Ser 
225 230 235 240 

Asn Leu lie lie Val Leu Asn Asp Asn Lys Gin Val Ser Leu Pro Thr 
245 250 255 

Ala Thr Val Asp Gly Pro Ala Pro Pro Val Gly Ala Leu Ser Lys Ala 
260 265 270 

Leu Thr Lys Leu Gin Ala Ser Arg Lys Phe Arg Gin Leu Arg Glu Ala 
275 280 285 

Ala Lys Ser Met Thr Lys Gin Met Gly Ala Pro Ala His Glu lie Ala 
290 295 300 

Ser Lys Leu Thr Gin Tyr Val Lys Gly Met Met Gly Lys Pro Gly Ala 
305 310 315 320 

Ser Leu Phe Glu Glu Leu Gly lie Tyr Tyr lie Gly Pro Val Asp Gly 
325 330 335 

His Asn Val Glu Asp Leu Val Tyr lie Phe Lys Lys Val Lys Glu Met 
340 345 350 

Pro Ala Pro Gly Pro Val Leu lie His lie lie Thr Glu Lys Gly Lys 
355 360 365 

Gly Tyr Pro Pro Ala Glu lie Ala Ala Asp Lys Met His Gly Val Val 
370 375 380 

Lys Phe Asp Ala Lys Thr Gly Lys Gin Met Lys Thr Lys Asn Lys Thr 
385 390 395 400 

Lys Ser Tyr Thr Gin Tyr Phe Ala Glu Ser Leu Val Ala Glu Ala Glu 
405 410 415 

His Asp Asp Lys lie Val Ala lie His Ala Ala Met Gly Gly Gly Thr 
420 425 430 

Gly Leu Asn lie Phe Gin Lys Gin Phe Pro Asp Arg Cys Phe Asp Val 
435 440 445 

Gly lie Ala Glu Gin His Ala Val Thr Phe Ala 7U.a Gly Met Ala Ala 
450 455 460 

Glu Gly Leu Lys Pro Phe Cys Ala lie Tyr Ser Ser Phe Leu Gin Arg 
465 470 475 480 

Gly Tyr Asp Gin Val Val His Asp Val Asp Leu Gin Lys Leu Pro Val 
485 490 495 

Arg Phe Met Met Asp Arg Ala Gly Val Val Gly Ala Asp Gly Pro Thr 
500 505 510 

His Cys Gly Ala Phe Asp Thr Thr Tyr Met Ala Cys Leu Pro Asn Met 
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515 520 525 

Val Val Met Ala Pro Ser Asp Glu Ala Glu Leu Met Asn Met lie Ala 
530 535 540 

Thr Ala Ala lie lie Asp Asp Arg Pro Ser Cys Val Arg Tyr Pro Arg 
545 550 555 560 

Gly Asn Gly lie Gly Val Ala Leu Pro Ser Asn Asn Lys Gly Thr Pro 
565 570 575 

Leu Glu lie Gly Lys Gly Arg lie Leu Lys Glu Gly Ser Lys Val Ala 
580 585 590 

lie Leu Gly Phe Gly Thr lie Val Gin Asn Cys Met Ala Ala Ala Asn 
595 600 605 

Leu Leu Glu Gin His Gly lie Ser Val Thr Val Ala Asp Ala Arg Phe 
610 615 620 

Cys Lys Pro Leu Asp Gly Asp Leu lie Lys Lys Leu Val Gin Glu His 
625 630 635 640 

Glu Val Leu He Thr Val Glu Glu Gly Ser He Gly Gly Phe Ser Ala 
645 650 655 

His He Ser His Phe Leu Ser Leu Asn Gly Leu Leu Asp Gly Asn Leu 
660 665 670 

Lys Trp Arg Pro Met Val Leu Pro Asp Arg Tyr He Asp His Gly Ala 
675 680 685 

Gin Ser Asp Gin He Glu Glu Ala Gly Leu Ser Pro Lys His He Ala 
690 695 700 

Gly Thr Val Val Ser Leu He Gly Gly Gly Lys Asp Ser Leu His Leu 
705 710 715 720 

He Asn Asn Leu 



<210> 7 
<211> 2561 
<212> DNA 

<213> Mentha piperita 

<220> 
<221> CDS 

<222> (128) . . (2308) 
<400> 7 

ctttctctct caacacttct ttcctttctt cactagctac ttaatttagt ccaagaaaga 60 

aagagaaaga gagagagaga gagagagaga gagaggagga gagagagaga gagagagaga 120 

gagagag atg gca tct tec tgt gga gtt ate aag age agt ttc ttg cca 169 
Met Ala Ser Ser Cys Gly Val He Lys Ser Ser Phe Leu Pro 
15 10 

teg etc cat tct gag gat tct acc ttc tta tea cgt get cct act tct 217 
Ser Leu His Ser Glu Asp Ser Thr Phe Leu Ser Arg Ala Pro Thr Ser 
15 20 25 30 
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ctt ccc etc aaa aat cat aag tta aat gtg gta gca get etc caa caa 265 

Leu Pro Leu Lys Asn His Lys Leu Asn Vai Val Ala Ala Leu Gin Gin 

35 40 45 

gat agt teg aac gac gac gtc gtt cct age gga gac agg ctg age egg 313 

Asp Ser Ser Asn Asp Asp Val Val Pro Ser Gly Asp Arg Leu Ser Arg 

50 55 60 

ecg aaa tea aga gea ctg agt tte ace gga gag aag cct ccc att eet 361 

Pro Lys Ser Arg Ala Leu Ser Phe Thr Gly Glu Lys Pro Pro He Pro 

65 70 75 



ata ctg gac ace ate aac tac cct aat eac atg aaa aat ett tec gtc 
He Leu Asp Thr He Asn Tyr Pro Asn His Met Lys Asn Leu Ser Val 
80 85 90 



gag etc ace gtc gca ctt cat eac gtt ttc aac aeg ccc gat gac aaa 
Glu Leu Thr Val Ala Leu His His Val Phe Asn Thr Pro Asp Asp Lys 
130 135 140 



ggt ttc cct aag aga gat gaa agt get eac gat gee ttc ggc gee ggc 

Gly Phe Pro Lys Arg Asp Glu Ser Ala His Asp Ala Phe Gly Ala Gly 

175 180 185 190 

cat agt tct acc agt att tct get ggt tta ggg atg geg gtg gcg aga 

His Ser Ser Thr Ser He Ser Ala Gly Leu Gly Met Ala Val Ala Arg 

195 200 205 



409 



gag gaa etc gea aac eta get gat gaa ctg agg gaa gag ata gtg tac 457 

Glu Glu Leu Ala Asn Leu Ala Asp Glu Leu Arg Glu Glu He Val Tyr 
95 100 105 110 

aeg gtg teg aaa ace ggc ggc cat ctt age teg age eta ggc gtg teg 505 

Thr Val Ser Lys Thr Gly Gly His Leu Ser Ser Ser Leu Gly Val Ser 
115 120 125 



553 



ate ate tgg gac gtc ggc eac cag get tac cca eac aaa ate ttg ace 601 
He He Trp Asp Val Gly His Gin Ala Tyr Pro His Lys He Leu Thr 
145 150 155 

ggg aga aga gcg aga atg cae aca att agg cag aca ttt gga tta gca 649 
Gly Arg Arg Ala Arg Met His Thr He Arg Gin Thr Phe Gly Leu Ala 
160 165 170 



697 



745 



gat tta ctg cag aag aac aac cae gtc ata teg gtg ate ggc gac ggc 793 

Asp Leu Leu Gin Lys Asn Asn His Val He Ser Val He Gly Asp Gly 

210 215 220 

gee atg aca get gga caa gcg tac gag gee tta aac aac gee gga tte 841 

Ala Met Thr Ala Gly Gin Ala Tyr Glu Ala Leu Asn Asn Ala Gly Phe 

225 230 235 

etc gat teg aat etc ata ate gtt tta aac gac aac aag cag gtg tct 889 

Leu Asp Ser Asn Leu He He Val Leu Asn Asp Asn Lys Gin Val Ser 

240 245 250 

eta ccc aeg gee ace gtc gac ggc cct geg ceg ecg gtc gga get ctg 937 

Leu Pro Thr Ala Thr Val Asp Gly Pro Ala Pro Pro Val Gly Ala Leu 

255 260 265 270 

age aaa gee ctg acc aag ctg caa gee age aga aaa ttc egg cag etc 985 

Ser Lys Ala Leu Thr Lys Leu Gin Ala Ser Arg Lys Phe Arg Gin Leu 

275 280 285 
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cgc 
Arg 


gaa 
Glu 


gca 

TV 1 — 

Ala 


gca 
AJ.a 
290 


aag 
Lys 


agt 
ber 


atg 
fie t 


act 
1 nr 


aag 
Lys 
295 


cag 
^±n 


atg 
ne u 


gga gca 


ecg 
Pro 
300 


gca 

Ala 


cat 
His 


gaa 
Glu 


ata 
He 


get 
Ala 
305 


teg 
Ser 


aag 

Lys 


ttg 
Leu 


aea 
Thr 


caa 
Gin 
310 


tac 
Tyr 


gtg 
Val 


aaa 
Lys 


ggg atg 
Gly Met 
315 


atg 
rieu 


ggg 

Vjxy 


aaa 
Lys 


cca 
Pro 


ggc get 
Gly Ala 
320 


tea 
Ser 


ctt 
Leu 


ttc 
Phe 


gaa 
Glu 
325 


gaa 
Glu 


ctg 
Leu 


ggg 

Gly 


att 

He 


tat 
Tyr 
330 


tac 
Tyr 


ate 


gga 


cca 
rro 


gtc 
Val 
335 


gac 
Asp 


ggc 
Gly 


cat 
His 


aac 
Asn 


gtt 
vaJ. 
340 


gaa 
Glu 


gat 
Asp 


ctt 

T All 

Lieu 


gtt 

\r-. 1 


tat 
Tyr 
345 


att 
He 


ttc 
Phe 


aag 


aaa 
Lys 


gtt 
Val 
350 


aag 
Lys 


gaa 
Glu 


atg 
Met 


cct 
Pro 


geg 
Ala 
355 


cct 
Pro 


ggg 

Gly 


cct 
Pro 


gtt 
vax 


ctt 

Tail 

1*6 U 

360 


att 

T 1 *a 

±xe 


cat 
His 


ate 
He 


ate 

T 1 

J. ^ c 


ace 
Th r 
365 


gaa 
Glu 


aaa 
Lys 


ggc 
Glv 


aaa 
Lvs 


ggc 
Gly 
370 


tac 
Tyr 


ccc 
Pro 


cct 
Pro 


gca 
Ala 


gaa 
Glu 
375 


att 
lie 


get 
Ala 


gee 
Ala 


gac 
Asp 


aaa 

LiyS 

380 


atg 
piet 


cat 
nis 


ggg 

Gly 


gtg 

Val 


gtg 
Val 
385 


aag 
Lys 


ttt 
Phe 


gat 
Asp 


geg 
Ala 


aaa 
Lys 
390 


act 
Thr 


ggg 

Gly 


aaa 
Lys 


cag 
Gin 


atg 
Met 
395 


aag 
Lys 


acg 


aag 
Lys 


aac 
Asn 


aag 
Lys 
400 


acg 
Thr 


aag 
Lys 


tea 
Ser 


tac 
Tyr 


ace 
Thr 
405 


cag 
Gin 


tac 
Tyr 


ttc 
Fne 


gee 
Aia 


gag 

Glu 
410 


tct 
Ser 


ctg 
Leu 


gtg 


geg 


gag 
Glu 
415 


geg 
Ala 


gag 
Glu 


eac 
His 


gac 
Asp 


gac 
Asp 
420 


aag 
Lys 


ate 
He 


gtg 
Val 


geg 
Ala 


ate 
He 
425 


cac 
His 


gee 
Ala 


gee 

AA. a 


atg 

Ua *- 

ne u 


ggg 

t»xy 
430 


ggc 
Gly 


ggc 
Gly 


ace 
Thr 


ggg 

Gly 


etc 
Leu 
435 


aac 
Asn 


ate 
He 


ttc 
Phe 


cag 
Gin 


aag 
Lys 
440 


cag 
G±n 


ttc 
Phe 


ecg 
Pro 


gac 
Asp 


egg 
Arg 
445 


tgc 
Cys 


ttc 


gac 
Asp 


gtc 
Val 


ggg 

Gly 
450 


ate 
lie 


geg 
Ala 


gag 

(jXU 


cag 
Lf±n 


cac 
nxs 
455 


geg 
Axa 


gtg 
V ax 


acg 
Thr 


ttc 
Phe 


gee 
Ala 
460 


gee 
Ala 


ggt 
Gly 


atg 
Met 


geg 
Ala 


geg 
Ala 
465 


gag 

Glu 


ggg 

Gly 


ctg 
Leu 


aag 
Lys 


cct 
Pro 
470 


ttc 

r> V, _ 

fne 


tgc 
Cys 


gee 
/u.a 


ate 
He 


tac 
Tyr 
475 


tec 

•u c ^ 


tec 
Ser 


ttc 
Phe 


etc 
Leu 


cag 
Gin 
480 


agg 
Arg 


ggc 
Gly 


tac 
Tyr 


gac 
Asp 


cag 
Gin 
485 


gtg 
Val 


gtc 
Vaj. 


cae 
nlS 


gac 
Asp 


gtc 
Val 
490 


gac 
Asp 


etc 
Leu 


cag 


aag 
Lys 


etc 
Leu 
495 


ecg 
Pro 


gtc 
Val 


egg 

Arg 


ttc 
Phe 


atg 
Met 
500 


atg 
Met 


gat 
Asp 


egg 
Arg 


gca 
AJ.a 


gga gtc 
Gly Val 
505 


gtc 
Val 


ggc 

\3±. y 


gee 
M_La 


gac 
Asp 
510 


ggc 

Gly 


ccc 
Pro 


ace 
Thr 


eac 
His 


tgc 
Cys 
515 


ggc 
Gly 


gee 
Ala 


ttc 
Phe 


gac 
Asp 


acc 
Thr 
520 


ace 
Thr 


tac 
Tyr 


atg 
Met 


gee 
Ala 


tgc 
Cys 
525 


etc 
Leu 


ccc 
Pro 


aac 
Asn 


atg 
Met 


gtg 
Val 


gtc 
Val 


atg 
Met 


get 
Ala 


ccc 
Pro 


tec 
Ser 


gac 
Asp 


gaa 
Glu 


geg 
Ala 


gag 
Glu 


etc 
Leu 


atg 
Met 


aac 
Asn 



1033 



1081 



1129 



1177 



1225 



1273 



1321 



1369 



1417 



1465 



1513 



1561 



1609 



1657 



1705 



1753 



530 535 540 
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atg ate gcc acc gcc gcc ate ate gae gac cga cet age tge gtc egg 1801 
Met lie Ala Thr Ala Ala lie lie Asp Asp Arg Pro Ser Cys Val Arg 
545 550 555 

tac cet aga ggg aac ggc ate ggc gtc get ctt ccg teg aac aac aaa 1849 
Tyr Pro Arg Gly Asn Gly lie Gly Val Ala Leu Pro Ser Asn Asn Lys 
560 565 570 

gga aet cca tta gag att ggt aag gga aga ate ttg aag gag ggg age 1897 
Gly Thr Pro Leu Glu lie Gly Lys Gly Arg lie Leu Lys Glu Gly Ser 
575 580 585 590 

aaa gtt gcg att ctg gga ttc gga aee ata gtg cag aac tgc atg gcg 1945 
Lys Val Ala lie Leu Gly Phe Gly Thr lie Val Gin Asn Cys Met Ala 
595 600 605 

gcg gcg aat ctt etc gaa caa cac gga ate tea gta aca gta gcc gat 1993 
Ala Ala Asn Leu Leu Glu Gin His Gly He Ser Val Thr Val Ala Asp 
610 615 620 

gca aga ttc tgc aag cca etc gat ggg gat ttg ata aag aaa ctg gtg 2041 
Ala Arg Phe Cys Lys Pro Leu Asp Gly Asp Leu He Lys Lys Leu Val 
625 630 635 

cag gag cat gaa gta etc ate aet gtt gaa gaa gga tee ate ggg ate 2089 
Gin Glu His Glu Val Leu He Thr Val Glu Glu Gly Ser He Gly He 
640 645 650 

ggt gga ttc agt get cac att tct cat ttc ttg tec etc aat ggc ttg 2137 
Gly Gly Phe Ser Ala His He Ser His Phe Leu Ser Leu Asn Gly Leu 
655 660 665 670 

etc gat gga aac etc aag tgg agg eea atg gtt ctt cca gat agg tac 2185 
Leu Asp Gly Asn Leu Lys Trp Arg Pro Met Val Leu Pro Asp Arg Tyr 
675 680 685 

att gat cat gga gca cag agt gat caa ata gaa gaa gca ggg ctg agt 2233 
He Asp His Gly Ala Gin Ser Asp Gin He Glu Glu Ala Gly Leu Ser 
690 695 700 

cet aag cat att gca ggg act gtt gtt tea ttg att gga gga gga aag 2281 
Pro Lys His He Ala Gly Thr Val Val Ser Leu He Gly Gly Gly Lys 
705 710 715 

gac agt ctt cat ttg att aat aat ttg taatattatt ttaatttatt 2328 
Asp Ser Leu His Leu He Asn Asn Leu 
720 725 

tcttcgaaaa ggaaaagaga aaaaaatgga gtctgaattt gagcagctge aaaaattctc 2388 

eatgagagat tagtgttaag atgtataatg taaatatggg ggaaggtcaa gaetettgac 244 8 

ecatggaaat tgggggaget gttctaaata attgttgtga tggcagcctt ttctacatgt 2508 

tttattcaat aaaatcattt gttacatttt aaaaaaaaaa aaaaaaaaaa aaa 2561 

<210> 8 
<211> 727 
<212> PRT 

<213> Mentha piperita 
<400> 8 

Met Ala Ser Ser Cys Gly Val He Lys Ser Ser Phe Leu Pro Ser Leu 
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15 10 15 

His Ser Glu Asp Ser Thr Phe Leu Ser Arg Ala Pro Thr Ser Leu Pro 
20 25 3D 

Leu Lys Asn His Lys Leu Asn Val Val Ala Ala Leu Gin Gin Asp Ser 
35 40 45 

Ser Asn Asp Asp Val Val Pro Ser Gly Asp Arg Leu Ser Arg Pro Lys 
50 55 60 

Ser Arg Ala Leu Ser Phe Thr Gly Glu Lys Pro Pro lie Pro lie Leu 
65 70 75 80 

Asp Thr He Asn Tyr Pro Asn His Met Lys Asn Leu Ser Val Glu Glu 
85 90 95 

Leu Ala Asn Leu Ala Asp Glu Leu Arg Glu Glu He Val Tyr Thr Val 
100 105 110 

Ser Lys Thr Gly Gly His Leu Ser Ser Ser Leu Gly Val Ser Glu Leu 
115 120 125 

Thr Val Ala Leu His His Val Phe Asn Thr Pro Asp Asp Lys He He 
130 135 140 

Trp Asp Val Gly His Gin Ala Tyr Pro His Lys He Leu Thr Gly Arg 
145 150 155 160 

Arg Ala Arg Met His Thr He Arg Gin Thr Phe Gly Leu Ala Gly Phe 
165 170 175 

Pro Lys Arg Asp Glu Ser Ala His Asp Ala Phe Gly Ala Gly His Ser 
180 185 190 

Ser Thr Ser He Ser Ala Gly Leu Gly Met Ala Val Ala Arg Asp Leu 
195 200 205 

Leu Gin Lys Asn Asn His Val He Ser Val He Gly Asp Gly Ala Met 
210 215 220 

Thr Ala Gly Gin Ala Tyr Glu Ala Leu Asn Asn Ala Gly Phe Leu Asp 
225 230 235 240 

Ser Asn Leu He He Val Leu Asn Asp Asn Lys Gin Val Ser Leu Pro 
245 250 255 

Thr Ala Thr Val Asp Gly Pro Ala Pro Pro Val Gly Ala Leu Ser Lys 
260 265 270 

Ala Leu Thr Lys Leu Gin Ala Ser Arg Lys Phe Arg Gin Leu Arg Glu 
275 280 285 

Ala Ala Lys Ser Met Thr Lys Gin Met Gly Ala Pro Ala His Glu He 
290 295 300 

Ala Ser Lys Leu Thr Gin Tyr Val Lys Gly Met Met Gly Lys Pro Gly 
305 310 315 320 

Ala Ser Leu Phe Glu Glu Leu Gly He Tyr Tyr He Gly Pro Val Asp 
325 330 335 

Gly His Asn Val Glu Asp Leu Val Tyr He Phe Lys Lys Val Lys Glu 
340 345 350 
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Met Pro Ala Pro Gly Pro Val Leu lie His lie lie Thr Glu Lys Gly 
355 360 365 

Lys Gly Tyr Pro Pro Ala Glu lie Ala Ala Asp Lys Met His Gly Val 
370 375 380 

Val Lys Phe Asp Ala Lys Thr Gly Lys Gin Met Lys Thr Lys Asn Lys 
385 390 395 400 

Thr Lys Ser Tyr Thr Gin Tyr Phe Ala Glu Ser Leu Val Ala Glu Ala 
405 410 415 

Glu His Asp Asp Lys lie Val Ala lie His Ala Ala Met Gly Gly Gly 
420 425 430 

Thr Gly Leu Asn lie Phe Gin Lys Gin Phe Pro Asp Arg Cys Phe Asp 
435 440 445 

Val Gly lie Ala Glu Gin His Ala Val Thr Phe Ala Ala Gly Met Ala 
450 455 460 

Ala Glu Gly Leu Lys Pro Phe Cys Ala lie Tyr Ser Ser Phe Leu Gin 
465 470 475 480 

Arg Gly Tyr Asp Gin Val Val His Asp Val Asp Leu Gin Lys Leu Pro 
485 490 495 

Val Arg Phe Met Met Asp Arg Ala Gly Val Val Gly Ala Asp Gly Pro 
500 505 510 

Thr His Cys Gly Ala Phe Asp Thr Thr Tyr Met Ala Cys Leu Pro Asn 
515 520 525 

Met Val Val Met Ala Pro Ser Asp Glu Ala Glu Leu Met Asn Met lie 
530 535 540 

Ala Thr Ala Ala lie lie Asp Asp Arg Pro Ser Cys Val Arg Tyr Pro 
545 550 555 560 

Arg Gly Asn Gly lie Gly Val Ala Leu Pro Ser Asn Asn Lys Gly Thr 
565 570 575 

Pro Leu Glu lie Gly Lys Gly Arg lie Leu Lys Glu Gly Ser Lys Val 
580 585 590 

Ala lie Leu Gly Phe Gly Thr lie Val Gin Asn Cys Met Ala Ala Ala 
595 600 605 

Asn Leu Leu Glu Gin His Gly lie Ser Val Thr Val Ala Asp Ala Arg 
610 615 620 

Phe Cys Lys Pro Leu Asp Gly Asp Leu lie Lys Lys Leu Val Gin Glu 
625 630 635 640 

His Glu Val Leu lie Thr Val Glu Glu Gly Ser lie Gly lie Gly Gly 
645 650 655 

Phe Ser Ala His lie Ser His Phe Leu Ser Leu Asn Gly Leu Leu Asp 
660 665 670 

Gly Asn Leu Lys Trp Arg Pro Met Val Leu Pro Asp Arg Tyr lie Asp 
675 680 685 
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His Gly Ala Gin Ser Asp Gin lie Glu Glu Ala Gly Leu Ser Pro Lys 
690 695 700 

His lie Ala Gly Thr Val Val Ser Leu He Gly Gly Gly Lys Asp Ser 
705 710 715 720 

Leu His Leu lie Asn Asn Leu 
725 



<210> 9 
<211> 617 
<212> DNA 

<213> Mentha piperita 
<400> 9 



aagagcgaga 


atgcacacaa 


ttaggcagac 


atttggatta 


gcaggtttcc 


ctaagagaga 


60 


tgaaagtgct 


cacgatgcct 


tcggcgccgg 


ccatagttct 


accagtattt 


ctgctggttt 


120 


agggatggcg 


gtggcgagag 


atttactgca 


gaagaacaac 


cacgtcatat 


cggtgatcgg 


180 


cgacggcgcc 


atgacagctg 


gacaagcgta 


cgaggcctta 


aacaacgccg 


gattcctcga 


240 


ttcgaatctc 


ataatcgttt 


taaacgacaa 


caagcaggtg 


tctctaccca 


cggccaccgt 


300 


cgacggccct 


gcgccgccgg 


tcggagctct 


gagcaaagcc 


ctgaccaagc 


tgcaagccag 


360 


cagaaaattc 


cggcagctcc 


gcgaagcagc 


aaagagtatg 


actaagcaga 


tgggagcacc 


420 


ggcacatgaa 


atagcttcga 


agttgacaca 


atacgtgaaa 


gggatgatgg 


ggaaaccagg 


480 


cgcttcactt 


ttcgaagaac 


tggggattta 


ttacatcgga 


ccagtcgacg 


gccataacgt 


540 


tgaagatctt 


gtttatattt 


tcaagaaagt 


taaggaaatg 


cctgcgcctg 


ggcctgttct 


600 


tattcatatc 


atcaccg 










617 



<210> 10 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: consensus 

transketolase motif wherein Xaa at position 4 represents a sequence of 7 to 8 
amino acids, Xaa at position 6 represents a sequence of 3 to 4 amino acids, and 
Xaa at position 8 represents a sequence of 11 to 13 amino acids 

<220> 

<221> DOMAIN 
<222> (1) . . (10) 

<400> 10 

Gly Asp Gly Xaa Glu Xaa Ala Xaa Asn Asn 
15 10 



<210> 11 
<211> 24 
<212> PRT 

<213> Artificial Sequence 
<220> 
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<223> Description of Artificial Sequence: transketolase 

motif from Mentha piperita wherein Xaa at position 3 represents Ala or Ser 
Xaa at position 4 represents any amino acid, Xaa at position 6 represents Ala 
or Gly, Xaa at position 8 represents Gin or Met, Xaa at position 10 represents 
any amino acid, Xaa at position 13 represents any amino acid, Xaa at 
position 15 represents Asn or His, Xaa at position 18 represents a sequence of 
7 to 8 amino acids, Xaa at position 19 represents lie or Val and Xaa at 
position 20 represents lie or Val 

<220> 

<221> DOMAIN 
<222> (1) . . (24) 

<400> 11 

Asp Gly Xaa Xaa Thr Xaa Gly Xaa Ala Xaa Glu Ala Xaa Asn Xaa Ala 
15 10 15 

Gly Xaa Xaa Xaa Leu Asn Asp Asn 
20 

<210> 12 
<211> 1400 
<212> DNA 

<213> Mentha piperita 
<400> 12 



gagttatcaa 


gagcagtttc 


ttgccatcgc 


tccattctga 


ggattctacc 


ttcttatcac 


60 


gtgctcctac 


ttctcttccc 


ctcaaaaatc 


ataagttaaa 


tgtggtagca 


gctctccaac 


120 


aagatagttc 


gaacgacgtc 


gttcctagcg 


gagacaggct 


gagccggccg 


aaatcaagag 


180 


cactgagttt 


caccggagag 


aagcctccca 


ttcctatact 


ggacaccatc 


aactacccta 


240 


atcacatgaa 


aaatctttcc 


gtcgaggaac 


tcgcaaacct 


agctgatgaa 


ctgagggaag 


300 


agatagtgta 


cacggtgtcg 


aaaaccggcg 


gccatcttag 


ctcgagccta 


ggcgtgtcgg 


360 


agctcaccgt 


cgcacttcat 


cacgttttca 


acacgcccga 


tgacaaaatc 


atctgggacg 


420 


tcggccacca 


ggcttaccca 


cacaaaatct 


tgaccgggag 


aagagcgaga 


atgcacacaa 


480 


ttaggcagac 


atttggatta 


gcaggtttcc 


ctaagagaga 


tgaaagtgct 


cacgatgcct 


540 


tcggcgccgg 


ccatagttct 


accagtattt 


ctgctggttt 


agggatggcg 


gtggcgagag 


600 


atttactgca 


gaagaacaac 


cacgtcatat 


cggtgatcgg 


cgacggcgcc 


atgacagctg 


660 


gacaagcgta 


cgaggcctta 


aacaacgccg 


gattcctcga 


ttcgaatctc 


ataatcgttt 


720 


taaacgacaa 


caagcaggtg 


tctctaccca 


cggccaccgt 


cgacggccct 


gcgccgccgg 


780 


tcggagctct 


gagcaaagcc 


ctgaccaagc 


tgcaagccag 


cagaaaattc 


cggcagctcc 


840 


gcgaagcagc 


aaagagtatg 


actaagcaga 


tgggagcacc 


ggcacatgaa 


atagcttcga 


900 


agttgacaca 


atacgtgaaa 


gggatgatgg 


ggaaaccagg 


cgcttcactt 


ttcgaagaac 


960 


tggggattta 


ttacatcgga 


ccagtcgacg 


gccataacgt 


tgaagatctt 


gtttatattt 


1020 


tcaagaaagt 


taaggaaatg 


cctgcgcctg 


ggcctgttct 


tattcatatc 


atcaccgaaa 


1080 


aaggcaaagg 


ctacccccct 


gcagaaattg 


ctgccgacaa 


aatgcatggg 


gtggtgaagt 


1140 
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ttgatgcgaa aactgggaaa cagatgaaga 

acttcgccga gtctctggtg gcggaggcgg 

ccgccatggg gggcggcacc gggctcaaca 

tcgacgtcgg gatcgcggag cagcacgcgg 
ggctgaagcc tttctgcgcc 



PCT/US98/18231 

cgaagaacaa gacgaagtca tacacccagt 1200 
agcacgacga caagatcgtg gcgatccacg 1260 
tcttccagaa gcagttcccg gaccggtgct 1320 
tgacgttcgc cgccggtatg gcggcggagg 1380 

1400 
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inventive concept under PCT Rule 13.1. In order for all inventions to be searched, the appropriate additional search fees 
must be paid. 

Group I. c!aim(s)l-7, 13-18. drawn to DNA, vectors, host cells, method of altering DXPS activity. 
Group II. claim(s) 8-12, drawn to protein. 

The inventions listed as Groups I and II do not relate to a single inventive concept under PCT Rule 13.1 because, under 
PCT Rule 13.2, they lack the same or corresponding special technical features for the following reasons: 

The broad claims read on isolated nucleic acid molecules which encompass the Arabidopsis OXPS nucleic acid 
disclosed by Mandel (Plant J. 9: 649-658. 1996), particularly in view of the language "at least a functional portion oP or 
"at least a portion of." Therefore, there is no special technical feature which links the DNA and the protein molecules, 
and because the DNA and protein molecules differ in composition, structure, and function, the two groups do not form a 
single inventive concept under PCT Rule 13.1. 
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